Skip to main content

A System for Detecting and Tracking Internet News Event

  • Conference paper
Advances in Multimedia Information Processing - PCM 2005 (PCM 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Abstract

News event detection is the task of discovering relevant, yet previously unreported real-life events and reporting it to users in human-readable form, while event tracking aims to automatically assign event labels to news stories when they arrive. A new method and system for performing the event detection and tracking task is proposed in this paper. The event detection and tracking method is based on subject extraction and an improved support vector machine (SVM), in which subject concepts can concisely and precisely express the meaning of a longer text. The improved SVM first prunes the negative examples, reserves and deletes a negative sample according to distance and class label, then trains the new set with SVM to obtain a classifier and maps the SVM outputs into probabilities. The experimental results with the real-world data sets indicate the proposed method is feasible and advanced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study final report. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, pp. 194–218. Morgan Kaufmann Publishers Inc., San Francisco (1998)

    Google Scholar 

  2. Gao, J.F.: An empirical study of CLIR at MSCN. In: Proceedings of the International Workshop ILT&CIP-2001 on Innovative Language Technology and Chinese Information Processing, German Research Center for Artificial Intelligence and Shanghai Jiao Tong University, Shanghai, pp. 55–62 (2001)

    Google Scholar 

  3. Papka, R.: On-line New Event Detection, Clustering, and Tracking. Ph. D. Thesis, University of Massachusetts at Amherst (1999)

    Google Scholar 

  4. Chen, G.L., Wang, Y.C.: The research on automatic abstract of Internet information. High Technology Letters 11(2), 33–36 (1999) (in Chinese)

    Google Scholar 

  5. Salton, G., Buckley, C.: Term-weighting approach in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  6. Kim, K., Jung, K., Park, S., Kim, H.: Support Vector Machines for Texture Classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(11), 1542–1550 (2002)

    Article  Google Scholar 

  7. Wahba, G.: Support Vector Machines, Reproducing Kernel Hilbert Spaces and The Randomized GACV. In: Advances in Kernel Methods Support Vector Learning, pp. 69–88. MIT Press, Massachusetts (1999)

    Google Scholar 

  8. Lei, Z., Wu, L.D., Lei, L., Liu, Y.C.: A System for Event Detection and Tracking Based on Constructive-Competition Clustering and KNNFL. To appear in the System Engineering Theory and Practice (2006) (in Chinese)

    Google Scholar 

  9. Chakrabarti, S.: Integrating The Document Object Model With Hyperlinks For Enhanced Topic Distillation and Information Extraction. In: Proceedings of the 10th ACM-WWW International Conference, pp. 211–220. ACM Press, Hong Kong (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lei, Z., Wu, Ld., Zhang, Y., Liu, Yc. (2005). A System for Detecting and Tracking Internet News Event. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_66

Download citation

  • DOI: https://doi.org/10.1007/11581772_66

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30027-4

  • Online ISBN: 978-3-540-32130-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics