Reconstruction of Markov Random Fields from Samples: Some Observations and Algorithms

Bresler, Guy; Mossel, Elchanan; Sly, Allan

doi:10.1007/978-3-540-85363-3_28

Guy Bresler¹,
Elchanan Mossel² &
Allan Sly³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5171))

Included in the following conference series:

International Workshop on Approximation Algorithms for Combinatorial Optimization
International Workshop on Randomization and Approximation Techniques in Computer Science

1447 Accesses

Abstract

Markov random fields are used to model high dimensional distributions in a number of applied areas. Much recent interest has been devoted to the reconstruction of the dependency structure from independent samples from the Markov random fields. We analyze a simple algorithm for reconstructing the underlying graph defining a Markov random field on n nodes and maximum degree d given observations. We show that under mild non-degeneracy conditions it reconstructs the generating graph with high probability using Θ(d logn) samples which is optimal up to a multiplicative constant. Our results seem to be the first results for general models that guarantee that the generating model is reconstructed. Furthermore, we provide an explicit O(d n ^d + 2 logn) running time bound. In cases where the measure on the graph has correlation decay, the running time is O(n ² logn) for all fixed d. In the full-length version we also discuss the effect of observing noisy samples. There we show that as long as the noise level is low, our algorithm is effective. On the other hand, we construct an example where large noise implies non-identifiability even for generic noise and interactions. Finally, we briefly show that in some cases, models with hidden nodes can also be recovered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A synthetic likelihood approach for intractable markov random fields

Article Open access 09 July 2022

Some results on the Gaussian Markov Random Field construction problem based on the use of invariant subgraphs

Article Open access 24 February 2022

Block Markov Chains on Trees

References

Chow, C.K., Liu, C.N.: Approximating discrete probability distributions with dependence trees. IEEE Trans. Info. Theory IT-14, 462–467 (1968)
Article Google Scholar
Chickering, D.: Learning Bayesian networks is NP-complete. In: Proceedings of AI and Statistics (1995)
Google Scholar
Abbeel, P., Koller, D., Ng, A.: Learning factor graphs in polynomial time and sample complexity. Journal of Machine Learning Research 7, 1743–1788 (2006)
MathSciNet Google Scholar
Santhanam, N., Wainwright, M.J.: Information-theoretic limits of graphical model selection in high dimensions (submitted, January 2008)
Google Scholar
Wainwright, M.J., Ravikumar, P., Lafferty, J.D.: High-dimensional graphical model selection using ℓ₁-regularized logistic regression. In: NIPS 2006, Vancouver, BC, Canada (2006)
Google Scholar
Baldassi, C., Braunstein, A., Brunel, N., Zecchina, R.: Efficient supervised learning in networks with binary synapses; arXiv:0707.1295v1
Google Scholar
Mahmoudi, H., Pagnani, A., Weigt, M., Zecchina, R.: Propagation of external and asynchronous dynamics in random Boolean networks; arXiv:0704.3406v1
Google Scholar
Dobrushin, R.L., Shlosman, S.B.: Completely analytical Gibbs fields. In: Fritz, J., Jaffe, A., Szasz, D. (eds.) Statistical mechanics and dynamical systems, pp. 371–403. Birkhauser, Boston (1985)
Google Scholar
Friedman, N.: Infering cellular networks using probalistic graphical models. In: Science (February 2004)
Google Scholar
Kasif, S.: Bayes networks and graphical models in computational molecular biology and bioinformatics, survey of recent research (2007), http://genomics10.bu.edu/bioinformatics/kasif/bayes-net.html
Daskalakis, C., Mossel, E., Roch, S.: Optimal phylogenetic reconstruction. In: STOC 2006: Proceedings of the 38th Annual ACM Symposium on Theory of Computing, pp. 159–168. ACM, New York (2006)
Chapter Google Scholar
Erdös, P.L., Steel, M.A., Székely, L.A., Warnow, T.A.: A few logs suffice to build (almost) all trees (part 1). Random Struct. Algor. 14(2), 153–184 (1999)
Article MATH Google Scholar
Mossel, E.: Distorted metrics on trees and phylogenetic forests. IEEE/ACM Trans. Comput. Bio. Bioinform. 4(1), 108–116 (2007)
Article MathSciNet Google Scholar
Bresler, G., Mossel, E., Sly, A.: Reconstruction of Markov Random Fields from Samples: Some Observations and Algorithms; arXiv:0712.1402v1
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Electrical Engineering and Computer Sciences, U.C. Berkeley,
Guy Bresler
Dept. of Statistics and Dept. of Electrical Engineering and Computer Sciences, U.C. Berkeley,
Elchanan Mossel
Dept. of Statistics, U.C. Berkeley,
Allan Sly

Authors

Guy Bresler
View author publications
You can also search for this author in PubMed Google Scholar
Elchanan Mossel
View author publications
You can also search for this author in PubMed Google Scholar
Allan Sly
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ashish Goel Klaus Jansen José D. P. Rolim Ronitt Rubinfeld

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bresler, G., Mossel, E., Sly, A. (2008). Reconstruction of Markov Random Fields from Samples: Some Observations and Algorithms. In: Goel, A., Jansen, K., Rolim, J.D.P., Rubinfeld, R. (eds) Approximation, Randomization and Combinatorial Optimization. Algorithms and Techniques. APPROX RANDOM 2008 2008. Lecture Notes in Computer Science, vol 5171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85363-3_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-85363-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85362-6
Online ISBN: 978-3-540-85363-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics