Abstract
Traditional method of Event Detection and Characterization (EDC) regards event detection task as classification problem. It makes words as samples to train classifier, which can lead to positive and negative samples of classifier imbalance. Meanwhile, there is data sparseness problem of this method when the corpus is small. This paper doesn’t classify event using word as samples, but cluster event in judging event types. It adopts self-similarity to convergence the value of K in K-means algorithm by the guidance of event triggers, and optimizes clustering algorithm. Then, combining with named entity and its comparative position information, the new method further make sure the pinpoint type of event. The new method avoids depending on template of event in tradition methods, and its result of event detection can well be used in automatic text summarization, text retrieval, and topic detection and tracking.
Supported by the national high-teach research and development plan of China under grant (863). NO. 2007AA01Z439.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ACE (Automatic Content Extraction) Chinese Annotation Guidelines for Events, National Institute of Standards and Technology (2005)
Surdeanu, M., Harabagiu, S., Williams, J., et al.: Using Predicate-Argument Structures for Information Extraction. In: Proceedings of ACL, pp. 8–15 (2003)
Surdeanu, M., Harabagiu, S.: Infrastructure for open-domain information extraction. In: Proceedings of the Human Language Technology Conference, pp. 325–330 (2002)
Chieu, H.L., Ng, H.T.: A Maximum entropy Approach to Information Extraction from Semi-Structured and Free Text. In: Proceedings of the 18th National Conference on Artificial Intelligence, pp. 786–791 (2002)
Ahn, D.: The Stages of Event Extraction. In: Proceedings of the Workshop on Annotations and Reasoning about Time and Events, pp. 1–8 (2006)
Yanyan, Z., bing, Q., Wanxiang, C., Ting, L.: Technology Research on Chinese Event Extraction. Journal of Chinese Information 22(1), 3–8 (2008)
Ding, C., He, X.: Cluster Merging and Splitting in Hierarchical Clustering Algorithms. In: Proceedings of the 2002 IEEE International Conference on Data Mining, Maebashi City, Japan, pp. 139–146. Maebashi TERRSA (2002)
Ding, C., He, X., Zha, H., et al.: A Min-Max Cut Algorithm for Graph Partitioning and Data Clustering. In: Proceedings of the IEEE International Conference on Data Mining, San Jose, California, USA, pp. 107–114 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Li, B., Tian, Y. (2009). Self-similarity Clustering Event Detection Based on Triggers Guidance. In: Liu, W., Luo, X., Wang, F.L., Lei, J. (eds) Web Information Systems and Mining. WISM 2009. Lecture Notes in Computer Science, vol 5854. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05250-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-05250-7_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05249-1
Online ISBN: 978-3-642-05250-7
eBook Packages: Computer ScienceComputer Science (R0)