Invited articleInformation geometry of the EM and em algorithms for neural networks
References (53)
Dualistic geometry of the manifold of higher-order neurons
Neural Networks
(1991)- et al.
Dualistic dynamical systems in the framework of information geometry
Physica D
(1995) - et al.
Noisy reinforcement training for pRAM nets
Neural Networks
(1994) - et al.
Differential geometric structures of stable feedback systems with dual connections
Differential geometry of curved exponential families—curvatures and information loss
Annals of Statistics
(1982)Differential geometrical methods in statistics
Differential geometry of a parametric family of invertible linear systems—Riemannian metric, dual affine connections and divergence
Mathematical Systems Theory
(1987)Differential geometrical theory of statistics
Fisher information under restriction of Shannon information in multiterminal situations
Annals of Institute of Statistical Mathematics
(1989)Mathematical foundations of neurocomputing
The EM algorithm and information geometry in neural network learning
Neural Computation
(1995)
Differential geometry in statistical inferences
Statistical inference under multiterminal rate restrictions—a differential geometrical approach
IEEE Transactions on Information Theory
(1989)
Information geometry of estimating functions in semiparametric statistical models
Information geometry of Boltzmann machines
IEEE Transactions on Neural Networks
(1992)
Smooth on-line learning algorithm for hidden Markov models
Neural Computation
(1994)
Information and exponential families in statistical theory
(1978)
Parametric statistical model and likelihood
The role of differential geometry in statistical theory
International Statistical Review
(1986)
Approximating exponential models
Annals of Institute of Statistical Mathematics
(1989)
A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
Annals of Mathematical Statistics
(1970)
Spatial statistics and Bayesian computation
Journal of Royal Statistical Society
(1993)
Alternating minimization and Boltzmann machine learning
IEEE Transactions on Neural Networks
(1992)
Neural networks—a review from statistical perspectives—comments and rejoinders
Statistical Science
(1994)
Statistical decision rules and optimal inference
(1972)
Theoretical statistics
(1974)
Cited by (257)
Natural Reweighted Wake–Sleep
2022, Neural NetworksCitation Excerpt :In their work the authors show conditions for the theoretical convergence of a modified version of the Wake–Sleep algorithm, identified as a variant of the geometric em algorithm. The convergence of the em and their relation to the Expectation–Maximization (EM) optimization process is known in literature and in particular has been studied by Amari (1995) and Fujiwara and Amari (1995). Notice that the algorithm by Ikeda et al. is using the exact FIM, while in the present work we are employing an estimation of the gradients and of the FIM based on the minibatch.
Geometry of EM and related iterative algorithms
2023, Information Geometry
Copyright © 1996 Published by Elsevier Ltd.