Loading [a11y]/accessibility-menu.js
Ad-hoc Information Retrieval based on Boosted Latent Dirichlet Allocated Topics | IEEE Conference Publication | IEEE Xplore

Ad-hoc Information Retrieval based on Boosted Latent Dirichlet Allocated Topics


Abstract:

Latent Dirichlet Allocation (LDA) is a fundamental method in the text mining field. We propose strategies for topic and model selection based on LDA that exploits the sem...Show More

Abstract:

Latent Dirichlet Allocation (LDA) is a fundamental method in the text mining field. We propose strategies for topic and model selection based on LDA that exploits the semantic coherence of the topics inferred, boosting the quality of the models found. Then we study how our boosted topic models perform in ad-hoc information retrieval tasks. Experimental results in four datasets show that our proposal improves the quality of the topics found favoring document retrieval tasks. Our method outperforms traditional LDA-based methods showing that model selection based on semantic coherence is useful for document modeling and information retrieval tasks.
Date of Conference: 05-09 November 2018
Date Added to IEEE Xplore: 06 May 2019
ISBN Information:
Print on Demand(PoD) ISSN: 1522-4902
Conference Location: Santiago, Chile

Contact IEEE to Subscribe

References

References is not available for this document.