Abstract
This paper presents a novel integrated second-order Hidden Markov Model (HMM) to extract event related named entities (NEs) and activities from short messages simultaneously. It uses second-order Markov chain to better model the context dependency in the string sequence. For decoding second-order HMM, a two-order Viterbi algorithm is used. The experiments demonstrate that combing NE and activities as an integrated model achieves better results than process them separately by NER for NEs and POS decoding for activities. The experimental results also showed that second-order HMM outperforms than first-order HMM. Furthermore, the proposed algorithm significantly reduces the complexity that can run in the handheld device in the real time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ingelbrecht, N., Gupta, S.B.M., Hart, T.J., Shen, S., Sato, A.: Forecast: Mobile Messaging, Major Markets Worldwide, 2004–2013 (2009), http://www.gartner.com
Yuanyong Feng, L.S., Zhang, J.: Early Results for Chinese Named Entity Recognition Using Conditional Random Fields Model, HMM and Maximum Entropy. In: Proceeding of NLP-KE 2005, pp. 549–552 (2005)
Yimo Guo, H.G.: A Chinese Person Name Recognization System Based on Agentbased HMM Position Tagging Model. In: Proceedings of the 6th World Congress on Intelligent Control and Automation, pp. 4069–4072 (2006)
Alireza Mansouri, L.S.A., Mamat, A.: A New Fuzzy Support Vector Machine Method for Named Entity Recognition. In: Computer Science and Information Technology (ICCSIT 2008), pp. 24–28 (2008)
Hongping Hu, H.Z.: Chinese Named Entity Recognition with CRFs: Two Levels. In: International Conference on Computational Intelligence and Security 2, CIS 2008, vol. 2, pp. 1–6 (2008)
Helmut Schmid, F.L.: Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging. In: Proceedings of the 22nd International Conference on Computational Linguistics (2008)
Manju, K.S.S., Idicula, S.M.: Development of A Pos Tagger for Malayalam-An Experience. In: 2009 International Conference on Advances in Recent Technologies in Communication and Computing, pp. 709–713 (2009)
Juan, W.: Research and Application of Statistical Language Model. In: Beijing University of Posts and Telecommunications 2009, pp. 81–82 (2009)
Thede, S.M., Harper, M.P.: A Second-Order Hidden Markov Model for Part-of-Speech Tagging. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 175–182 (1999)
Richard Zens, H.N.: Word Graphs for statistical Machine Translation. In: Proceedings of the ACL Workshop on Building and Using Parallel Texts, pp. 191–198 (2005)
Franz Josef Och, N.U., Ney, H.: An Efficient A* Search Algorithm for Statistical Machine Translation. In: Proceedings of the ACL Workshop on Data- Driven methods in Machine Translation, Toulouse, France, vol. 14, pp. 1–8 (2001)
Ye-Yi Wang, A.W.: Decoding algorithm in statistical machine translation. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 366–372 (1997)
Yu, H.: Chinese Lexical Analisis and Named Entity Identification Using Hierachical Hidden Markov Model, Beijing University of Chemical Technology (2004)
Stanley, F., Chen, J.G.: An Empirical Study of Smoothing Techniques for Language Modeling. In: Technical Report TR-10-98, Computer Science Group, Harvard University (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, H., Wang, X., Tian, J. (2010). Second-Order HMM for Event Extraction from Short Message. In: Hopfe, C.J., Rezgui, Y., Métais, E., Preece, A., Li, H. (eds) Natural Language Processing and Information Systems. NLDB 2010. Lecture Notes in Computer Science, vol 6177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13881-2_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-13881-2_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13880-5
Online ISBN: 978-3-642-13881-2
eBook Packages: Computer ScienceComputer Science (R0)