Abstract
In temporal text mining, some importance indices such as simple appearance frequency, tf-idf, and differences of some indices play the key role to identify remarkable trends of terms in sets of documents. However, most of conventional methods have treated their remarkable trends as discrete statuses for each time-point or fixed period. In order to find their trends as continuous statuses, we have considered the values of importance indices of the terms in each time-point as temporal behaviors of the terms. In this paper, we describe the method to identify the temporal behaviors of terms on several importance indices by using the linear trends. Then, we show a comparison between visualizations on each time-point by using composed indices with PCA and the trends of the emergent terms, which are detected the burst word detection method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lent, B., Agrawal, R., Srikant, R.: Discovering trends in text databases. In: Proc. of the Third International Conference on Knowledge Discovery and Data Mining (KDD 1997), pp. 227–230 (1997)
Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A survey of emerging trend detection in textual data mining. A Comprehensive Survey of Text Mining (2003)
Abe, H., Tsumoto, S.: Detecting temporal trends of technical phrases by using importance indices and linear regression. In: Proc. of the 18th International Symposium on Methodologies for Intelligent Systems (ISMIS 2009), pp. 251–259. Springer, Heidelberg (2009)
Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: KDD 2005: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pp. 198–207. ACM, New York (2005)
Keogh, E., Chu, S., Hart, D., Pazzani, M.: Segmenting time series: A survey and novel approach. In: An Edited Volume, Data mining in Time Series Databases, pp. 1–22. World Scientific, Singapore (2003)
Liao, T.W.: Clustering of time series data: a survey. Pattern Recognition 38, 1857–1874 (2005)
Anderberg, M.R.: Cluster Analysis for Applications. In: Monographs and Textbooks on Probability and Mathematical Statistics. Academic Press, Inc., New York (1973)
Kleinberg, J.M.: Bursty and hierarchical structure in streams. Data Min. Knowl. Discov. 7(4), 373–397 (2003)
Ohsaki, M., Abe, H., Yamaguchi, T.: Numerical time-series pattern extraction based on irregular piecewise aggregate approximation and gradient specification. New Generation Comput. 25(3), 213–222 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abe, H., Tsumoto, S. (2009). Comparing Temporal Behavior of Phrases on Multiple Indexes with a Burst Word Detection Method. In: Sakai, H., Chakraborty, M.K., Hassanien, A.E., Ślęzak, D., Zhu, W. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2009. Lecture Notes in Computer Science(), vol 5908. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10646-0_61
Download citation
DOI: https://doi.org/10.1007/978-3-642-10646-0_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10645-3
Online ISBN: 978-3-642-10646-0
eBook Packages: Computer ScienceComputer Science (R0)