Abstract
News event detection is the task of discovering relevant, yet previously unreported real-life events and reporting it to users in human-readable form, while event tracking aims to automatically assign event labels to news stories when they arrive. A new method and system for performing the event detection and tracking task is proposed in this paper. The event detection and tracking method is based on subject extraction and an improved support vector machine (SVM), in which subject concepts can concisely and precisely express the meaning of a longer text. The improved SVM first prunes the negative examples, reserves and deletes a negative sample according to distance and class label, then trains the new set with SVM to obtain a classifier and maps the SVM outputs into probabilities. The experimental results with the real-world data sets indicate the proposed method is feasible and advanced.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study final report. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, pp. 194–218. Morgan Kaufmann Publishers Inc., San Francisco (1998)
Gao, J.F.: An empirical study of CLIR at MSCN. In: Proceedings of the International Workshop ILT&CIP-2001 on Innovative Language Technology and Chinese Information Processing, German Research Center for Artificial Intelligence and Shanghai Jiao Tong University, Shanghai, pp. 55–62 (2001)
Papka, R.: On-line New Event Detection, Clustering, and Tracking. Ph. D. Thesis, University of Massachusetts at Amherst (1999)
Chen, G.L., Wang, Y.C.: The research on automatic abstract of Internet information. High Technology Letters 11(2), 33–36 (1999) (in Chinese)
Salton, G., Buckley, C.: Term-weighting approach in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)
Kim, K., Jung, K., Park, S., Kim, H.: Support Vector Machines for Texture Classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(11), 1542–1550 (2002)
Wahba, G.: Support Vector Machines, Reproducing Kernel Hilbert Spaces and The Randomized GACV. In: Advances in Kernel Methods Support Vector Learning, pp. 69–88. MIT Press, Massachusetts (1999)
Lei, Z., Wu, L.D., Lei, L., Liu, Y.C.: A System for Event Detection and Tracking Based on Constructive-Competition Clustering and KNNFL. To appear in the System Engineering Theory and Practice (2006) (in Chinese)
Chakrabarti, S.: Integrating The Document Object Model With Hyperlinks For Enhanced Topic Distillation and Information Extraction. In: Proceedings of the 10th ACM-WWW International Conference, pp. 211–220. ACM Press, Hong Kong (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lei, Z., Wu, Ld., Zhang, Y., Liu, Yc. (2005). A System for Detecting and Tracking Internet News Event. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_66
Download citation
DOI: https://doi.org/10.1007/11581772_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)