Feature Word Tracking in Time Series Documents

Takasu, Atsuhiro; Tanaka, Katsuaki

doi:10.1007/978-3-540-28651-6_97

Atsuhiro Takasu¹⁹ &
Katsuaki Tanaka²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3177))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1036 Accesses

Abstract

Data mining from time series documents is a new challenge in text mining and, for this purpose, time dependent feature extraction is an important problem. This paper proposes a method to track feature terms in time series documents. When analyzing and mining time series data, the key is to handle time information. The proposed method applies non-linear principal component analysis to document vectors that consist of term frequencies and time information. This paper reports preliminary experimental results in which the proposed method is applied to a corpus of topic detection and tracking, and we show that the proposed method is effective in extracting time dependent terms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Using LDA and Time Series Analysis for Timestamping Documents

Using Time Series Analysis for Estimating the Time Stamp of a Text

Enriching feature engineering for short text samples by language time series analysis

Article Open access 31 August 2020

References

Allan, J.: Topic Detection and Tracking: Event-based Infromation Organization. Kluwer Academic Publishers, Dordrecht (2002)
Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Infromation Retrieval. Addison-Wesley, Reading (1999)
Google Scholar
Brants, T., Chen, F., Farahat, A.: A system for new event detection. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 330–337 (2003)
Google Scholar
Deerwester, S., Dumais, S.T., Harshman, R.: Indexing by latent semantic analysis. Journal of American Society of Information Systems 41(2), 391–407 (1990)
Article Google Scholar
Franz, M., Scott McCarley, J.: Unsupervised and supervised clustering for topic tracking. In: Proceedings of the 21th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 310–317 (2000)
Google Scholar
Ishikawa, Y., Chen, Y., Kitagawa, H.: An on-line document clustering method based on forgetting factors. In: Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries, pp. 325–339 (2001)
Google Scholar
Scholkopf, B., Smola, A., Muller, K.-R.: Nonlinear component analysis as a kernel eigenvalue problem. Technical Report 44, Max-Plank- Institute fur Biolgische Kybernetik (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo, 101-8430, Japan
Atsuhiro Takasu
AI Laboratory, RCAST, University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 101-8430, Japan
Katsuaki Tanaka

Authors

Atsuhiro Takasu
View author publications
You can also search for this author in PubMed Google Scholar
Katsuaki Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Computing, and Mathematics, University of Exeter, EX4 4QF, Exeter, UK
Zheng Rong Yang
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
School of Engineering, Computer Science and Mathematics, University of Exeter, EX4 4QF, UK
Richard M. Everson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Takasu, A., Tanaka, K. (2004). Feature Word Tracking in Time Series Documents. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_97

Download citation

DOI: https://doi.org/10.1007/978-3-540-28651-6_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22881-3
Online ISBN: 978-3-540-28651-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics