Repository | Book | Chapter
EM learning for symbolic-statistical models in statistical abduction
pp. 189-200
Abstract
We first review a logical-statistical framework called statistical abduction and identify its three computational tasks, one of which is the learning of parameters from observations by ML (maximum likelihood) estimation. Traditionally, in the presence of missing values, the EM algorithm has been used for ML estimation. We report that the graphical EM algorithm, a new EM algorithm developed for statistical abduction, achieved the same time complexity as specialized EM algorithms developed in each discipline such as the Inside-Outside algorithm for PCFGs (probabilistic context free grammars). Furthermore, learning experiments using two corpora revealed that it can outperform the Inside-Outside algorithm by orders of magnitude. We then specifically look into a family of extensions of PCFGs that incorporate context sensitiveness into PCFGs. Experiments show that they are learnable by the graphical EM algorithm using at most twice as much time as plain PCFGs even though these extensions have higher time complexity.
Publication details
Published in:
Arikawa Setsuo, Shinohara Ayumi (2002) Progress in discovery science: final report of the Japanese discovery science project. Dordrecht, Springer.
Pages: 189-200
Full citation:
Sato Taisuke (2002) „EM learning for symbolic-statistical models in statistical abduction“, In: S. Arikawa & A. Shinohara (eds.), Progress in discovery science, Dordrecht, Springer, 189–200.