Skip to main content

On Spectral Learning of Mixtures of Distributions

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3559))

Abstract

We consider the problem of learning mixtures of distributions via spectral methods and derive a characterization of when such methods are useful. Specifically, given a mixture-sample, let \(\bar\mu_{i}, {\bar C_{i}}, \bar w_{i}\) denote the empirical mean, covariance matrix, and mixing weight of the samples from the i-th component. We prove that a very simple algorithm, namely spectral projection followed by single-linkage clustering, properly classifies every point in the sample provided that each pair of means \(\bar\mu_{i},\bar\mu_{j}\) is well separated, in the sense that \(\|\bar\mu_{i} - \bar\mu_{j}\|^{2}\) is at least \(\|{\bar C_{i}\|_{2}(1/\bar w_{i}+1/\bar w_{j})}\) plus a term that depends on the concentration properties of the distributions in the mixture. This second term is very small for many distributions, including Gaussians, Log-concave, and many others. As a result, we get the best known bounds for learning mixtures of arbitrary Gaussians in terms of the required mean separation. At the same time, we prove that there are many Gaussian mixtures {(μ i ,C i ,w i )} such that each pair of means is separated by ||C i ||2(1/w i  + 1/w j ), yet upon spectral projection the mixture collapses completely, i.e., all means and covariance matrices in the projected mixture are identical.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arora, S., Kannan, R.: Learning mixtures of arbitrary Gaussians. In: Proc. 33rd ACM Symposium on Theory of Computation, pp. 247–257 (2001)

    Google Scholar 

  2. Dasgupta, S.: Learning mixtures of Gaussians. In: Proc. 40th IEEE Symposium on Foundations of Computer Science, pp. 634–644 (1999)

    Google Scholar 

  3. Dasgupta, S., Schulman, L.: A 2-round variant of EM for Gaussian mixtures. In: Proc. 16th Conference on Uncertainty in Artificial Intelligence, pp. 152–159 (2000)

    Google Scholar 

  4. Lindsay, B.: Mixture models: theory, geometry and applications. American Statistical Association, Virginia (2002)

    Google Scholar 

  5. Titterington, D.M., Smith, A.F.M., Makov, U.E.: Statistical analysis of finite mixture distributions. Wiley, Chichester (1985)

    MATH  Google Scholar 

  6. Vempala, S., Wang, G.: A Spectral Algorithm of Learning Mixtures of Distributions. In: Proc. 43rd IEEE Symposium on Foundations of Computer Science, pp. 113–123 (2002)

    Google Scholar 

  7. Soshnikov, A.: A Note on Universality of the Distribution of the Largest Eigenvalues in Certain Sample Covariance Matrices. J. Stat. Phys. 108(5/6), 1033–1056 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  8. Salmasian, H., Kannan, R., Vempala, S.: The Spectral Method for Mixture Models. In: Electronic Colloquium on Computational Complexity (ECCC), vol. (067) (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Achlioptas, D., McSherry, F. (2005). On Spectral Learning of Mixtures of Distributions. In: Auer, P., Meir, R. (eds) Learning Theory. COLT 2005. Lecture Notes in Computer Science(), vol 3559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11503415_31

Download citation

  • DOI: https://doi.org/10.1007/11503415_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26556-6

  • Online ISBN: 978-3-540-31892-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics