Abstract
This paper focuses on named entity recognition corresponding to people, organizations, locations, etc. in Chinese scientific documents. Two key benefits are shown by performing NER: (i) improved quality of semantic retrieval, and (ii) improvement in subsequent machine translation. Experiments using the Semantex platform for information extraction illustrate and quantify the two benefits outlined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Papineni, K., Roukos, S., Ward, T., Zhu, W.: Bleu: a method for automatic evaluation of machine translation (2001)
Wan, S., Verspoor, C.: Automatic english-chinese name transliteration for development of multilingual resources. In: COLING-ACL, pp. 1352–1356 (1998)
Doddington, G., Mitchell, A., Pryzbocki, M., Ramshaw, L., Weischedel, R.: The Automatic Content Extraction (ACE) Program, Tasks, Data, and Evaluation (2004)
Ji, H., Blume, M., Freitag, D., Grishman, R., Khadivi, S., Zens, R.: Nyu-fair isaac-rwth chinese to english entity translation 2007 system. In: Proceedings of NIST ET 2007 PI/Evaluation Workshop, Washington, USA (2007)
Srihari, R.K., Li, W., Cornell, T., Niu, C.: Infoxtract: A customizable intermediate level information extraction engine. Natural Language Engineering 12 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Srihari, R.K., Peterson, E. (2008). Named Entity Recognition for Improving Retrieval and Translation of Chinese Documents. In: Buchanan, G., Masoodian, M., Cunningham, S.J. (eds) Digital Libraries: Universal and Ubiquitous Access to Information. ICADL 2008. Lecture Notes in Computer Science, vol 5362. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89533-6_56
Download citation
DOI: https://doi.org/10.1007/978-3-540-89533-6_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89532-9
Online ISBN: 978-3-540-89533-6
eBook Packages: Computer ScienceComputer Science (R0)