Abstract
This study considers the problem of automatic trend detection in document collections related to several specific domains. The suggested trend detection algorithm is based on the domain-specific trend model. The algorithm was evaluated on documents from shipbuilding and power engineering domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The code, instructions and results can be found on https://bitbucket.org/ilnurgadelshin/trends.
- 2.
The full lists of the found trends could be found on https://bitbucket.org/ilnurgadelshin/trends.
References
Porter, F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Natural Language Toolkit. http://www.nltk.org/
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Vorontsov, K.V., Potapenko, A.A.: Regularization, robustness and sparseness of probabilistic topic models. Comput. Res. Model. 2, 161–174 (2012) (in Russian)
Teh, Y.W., Jordan, M.I.: Hierarchical Bayesian nonparametric models with applications. In: Hjort, N., Holmes, C., Müller, P., Walker, S. (eds.) Bayesian Nonparametrics Principles and Practice. Cambridge University Press, Cambridge (2009)
Teh, Y.W.: Dirichlet processes. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 280–287. Springer, Heidelberg (2010)
Blei, D, Lafferty, J.: Dynamic topic models. In: ICML (2006)
Glance, N., Hurst, M., Tomokiyo, T.: BlogPulse: automated trend discovery for weblogs. In: WWW 2004, ACM (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Gadelshin, I., Antonova, A., Ilvovsky, D. (2014). Detection of Domain-Specific Trends in Text Collections. In: Ignatov, D., Khachay, M., Panchenko, A., Konstantinova, N., Yavorsky, R. (eds) Analysis of Images, Social Networks and Texts. AIST 2014. Communications in Computer and Information Science, vol 436. Springer, Cham. https://doi.org/10.1007/978-3-319-12580-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-12580-0_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12579-4
Online ISBN: 978-3-319-12580-0
eBook Packages: Computer ScienceComputer Science (R0)