Abstract
This paper presents a simple rule based approach to organization name recognition in Chinese text. Based on Chinese knowledge sources, our approach detects potential left and right boundaries in a text, and then determines whether a left-right boundary pair encloses an organization name by using a length constraint and non-organization name words/POS-tag constraints. Organization names with nested structure are also processed. This approach is easy to implement and the evaluation results are satisfactory.
This work is funded by National Natural Science Foundation of Chinese (No. 60473138).
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Borthwick, A.: A Maximum Entropy Approach to Named Entity Recognition. Ph.D. Thesis, New York University (1999)
Leong, C.H., Ng, H.T.: Named Entity Recognition: A Maximum Entropy Approach Using Global Information. In: Proceedings of Coling 2002, Taipei, pp. 190–197 (2002)
Silviu, C., Yarowsky, D.: Language Independent Named Entity Recognition Combining Morphological and Contextual Evidence. In: Proc. of 1999 Joint SIGDAT Conference on Empirical Methods in NLP & Very Large Corpora., pp. 90–99 (1999)
Zhang, Y., Huang, D., Zhang, L., Yang, Y.: Identification of Chinese Organization Names based on Statistics and Rules. In: Proceedings of JSCL 2001(Natural Language Understanding and Machine Translation), China, pp. 233–239 (2001)
Yu, H., Zhang, H., Liu, Q.: Recognition of Chinese Organization Name based Role Tagging. In: Proceedings of Advances in Computation of Oriental Languages, Beijing, pp. 79–87 (2003)
Luo, Z., Song, R.: An integrated and fast Chinese Word Segmentation. In: Proceedings of International Chinese Computing Conference, Singapore, pp. 323–328 (2001)
Yu, S., et al.: The Grammatical Knowledge-base of Contemporary Chinese—A Complete Specification, 2nd edn. TsingHua University Press, Beijing (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Houfeng, W., Wuguang, S. (2005). A Simple Rule-Based Approach to Organization Name Recognition in Chinese Text. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_86
Download citation
DOI: https://doi.org/10.1007/978-3-540-30586-6_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)