skip to main content
10.1145/1076034.1076148acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Relation between PLSA and NMF and implications

Published: 15 August 2005 Publication History

Abstract

Non-negative Matrix Factorization (NMF, [5]) and Probabilistic Latent Semantic Analysis (PLSA, [4]) have been successfully applied to a number of text analysis tasks such as document clustering. Despite their different inspirations, both methods are instances of multinomial PCA [1]. We further explore this relationship and first show that PLSA solves the problem of NMF with KL divergence, and then explore the implications of this relationship.

References

[1]
W. Buntine. Variational extensions to EM and multinomial PCA. In ECML'02, 2002.
[2]
E. Gaussier, C. Goutte, K. Popat, and F. Chen. A hierarchical model for clustering and categorising documents. In Advances in Information Retrieval, Lecture Notes in Computer Science. Springer, 2002.
[3]
C. Goutte, K. Yamada, and E. Gaussier. Aligning words using matrix factorisation. In ACL'04, 2004.
[4]
T. Hofmann. Probabilistic latent semantic analysis. In UAI'99, pages 289--296. Morgan Kaufmann, 1999.
[5]
D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401:788--791, 1999.
[6]
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In NIPS'13. MIT Press, 2001.
[7]
G. McLachlan and D. Peel. Finite Mixture Models. Wiley, 2000.
[8]
W. Xu, X. Liu, and Y. Gong. Document clustering based on non-negative matrix factorization. In SIGIR'03, 2003.

Cited By

View all
  • (2024)Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and InsightsTechnologies10.3390/technologies1201000512:1(5)Online publication date: 3-Jan-2024
  • (2023)Block-Active ADMM to Minimize NMF with Bregman DivergencesSensors10.3390/s2316722923:16(7229)Online publication date: 17-Aug-2023
  • (2023)Application of Latent Semantic Analysis in Accounting ResearchJournal of Information Systems10.2308/ISYS-2022-01337:3(139-155)Online publication date: 25-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
August 2005
708 pages
ISBN:1595930345
DOI:10.1145/1076034
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 August 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. NMF
  2. PLSA
  3. document clustering
  4. probabilistic models

Qualifiers

  • Article

Conference

SIGIR05
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)36
  • Downloads (Last 6 weeks)4
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and InsightsTechnologies10.3390/technologies1201000512:1(5)Online publication date: 3-Jan-2024
  • (2023)Block-Active ADMM to Minimize NMF with Bregman DivergencesSensors10.3390/s2316722923:16(7229)Online publication date: 17-Aug-2023
  • (2023)Application of Latent Semantic Analysis in Accounting ResearchJournal of Information Systems10.2308/ISYS-2022-01337:3(139-155)Online publication date: 25-Oct-2023
  • (2023)Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speechEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-023-00280-z2023:1Online publication date: 7-Apr-2023
  • (2023)GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membershipGenome Biology10.1186/s13059-023-03067-924:1Online publication date: 19-Oct-2023
  • (2023)Recent Research Trend Analysis for the Journal of Society of Korea Industrial and Systems Engineering Using Topic ModelingJournal of Society of Korea Industrial and Systems Engineering10.11627/jksie.2023.46.3.17046:3(170-185)Online publication date: 30-Sep-2023
  • (2023)Incorporating Embedding to Topic Modeling for More Effective Short Text AnalysisCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3587316(73-76)Online publication date: 30-Apr-2023
  • (2023)Orthogonal parametric non-negative matrix tri-factorization with α-divergence for co-clusteringExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120680231:COnline publication date: 30-Nov-2023
  • (2023)Neural nonnegative matrix factorization for hierarchical multilayer topic modelingSampling Theory, Signal Processing, and Data Analysis10.1007/s43670-023-00077-322:1Online publication date: 19-Dec-2023
  • (2022)Recent Trends in AI-Based Intelligent SensingElectronics10.3390/electronics1110166111:10(1661)Online publication date: 23-May-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media