Abstract
Noun phrase chunking is a sub-category of shallow parsing that can be used for many natural language processing tasks. In this paper, we propose a noun phrase chunker system for Turkish texts. We use a weighted constraint dependency parser to represent the relationship between sentence components and to determine noun phrases. The dependency parser uses a set of hand-crafted rules which can combine morphological and semantic information for constraints. The rules are suitable for handling complex noun phrase structures because of their flexibility. The developed dependency parser can be easily used for shallow parsing of all phrase types by changing the employed rule set. The lack of reliable human tagged datasets is a significant problem for natural language studies about Turkish. Therefore, we constructed a noun phrase dataset for Turkish. According to our evaluation results, our noun phrase chunker gives promising results on this dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abney, S.: Parsing by chunks. In: Principle-Based Parsing, pp. 257–278 (1991)
Argamon, S., Dagan, I., Krymolowski, Y.: A memory-based approach to learning shallow natural language patterns. In: Proceedings of ACL’98. Association for Computational Linguistics, pp. 67–73 (1998)
Cardie, C., Pierce, D.: Error-driven pruning of Treebank grammars for base noun phrase identification. In: Proceedings of COLING’98. Association for Computational Linguistics, pp. 218–224 (1998)
Church, K.W.: A stochastic parts program and noun phrase parser for unrestricted text. In: Proceedings of the Second Conference on Applied Natural Language Processing, pp. 136–143. Austin, Texas (1988)
Daelemans, W., Bosch, A.V., Zavrel, J.: Forgetting exceptions is harmful in language learning. Mach. Learn. 34, 11–14 (1999)
Daybelge, T., Cicekli, I.: A rule-based morphological disambiguator for Turkish. Proceedings of Recent Advances in Natural Language Processing (RANLP 2007), pp. 145–149. Borovets, Bulgaria (2007)
Eryiğit, G., Oflazer, K.: Statistical dependency parsing of Turkish. In: Proceedings of 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2006), Trento, Italy (2006)
Hammerton, J., Osborne, M., Armstrong, S., Daelemans, W.: Introduction to special issue on machine learning approaches to shallow parsing. J. Mach. Learn. Res. 2, 551–558 (2002)
Istek, O., Cicekli, I.: A link grammar for an Agglutinative language. In: Proceedings of Recent Advances in Natural Language Processing (RANLP 2007), pp. 285–290. Borovets, Bulgaria (2007)
Kutlu, M., Cicekli, I.: A hybrid morphological disambiguation system for Turkish. In: Proceedings of the 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan (2013)
Munoz, M., Punyakanok, V., Roth, D., Zimak, D.: A learning approach to shallow parsing. In: Proceedings of EMNLP-WVLC’99. Association for Computational Linguistics (1999)
Nivre, J.: Dependency grammar and dependency parsing. Vaxjö University: School of Mathematics and Systems Engineering: Technical Report MSI report 05133 (2005)
Oflazer, K.: Dependency parsing with an extended finite-state approach. Comput. Linguist. 29, 515–544 (2003)
Pattabhi, R.K., Vijay, S.R., Vijayakrishna, R., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In: Proceedings of IJCAI-07 Workshop on “Shallow Parsing for South Asian Languages” (2007)
Ramshaw, L.A., Marcus, M.P.: Text chunking using transformation-based learning. In: Proceedings of the Third Workshop on Very Large Corpora, ACL (1995)
Sastry, G.R., Chaudhuri, S., Reddy, P.N.: An HMM based part-of-speech tagger and statistical chunker for 3 Indian languages. In: Proceedings of IJCAI-07 Workshop on “Shallow Parsing for South Asian Languages” (2007)
Sobha, L., Vijay, S.R. Noun phrase chunking in Tamil. In: Proceedings of the MSPIL-06, Bombay, pp. 194–198 (2006)
Veenstra, J.: Fast NP chunking using memory-based learning techniques. In: Proceedings of the Eighth Belgian-Dutch Conference on Machine Learning (1998)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Kutlu, M., Cicekli, I. (2016). Noun Phrase Chunking for Turkish Using a Dependency Parser. In: Abdelrahman, O., Gelenbe, E., Gorbil, G., Lent, R. (eds) Information Sciences and Systems 2015. Lecture Notes in Electrical Engineering, vol 363. Springer, Cham. https://doi.org/10.1007/978-3-319-22635-4_35
Download citation
DOI: https://doi.org/10.1007/978-3-319-22635-4_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22634-7
Online ISBN: 978-3-319-22635-4
eBook Packages: EngineeringEngineering (R0)