Abstract
A complex noun sequence is one in which a head noun is recursively modified by one or more bare nouns and/or genitives Constituency analysis of complex noun sequence is a prerequisite for finding dependency relation (semantic relation) between components of the sequence. Identification of dependency relation is useful for various applications such as question answering, information extraction, textual entailment, paraphrasing.
In Hindi, syntactic agreement rules can handle to a large extent the parsing of recursive genitives (Sharma, 2012)[12].This paper implements frequency based corpus driven approaches for parsing recursive genitive structures that syntactic rules cannot handle as well as recursive compound nouns and combination of gentive and compound noun sequences. Using syntactic rules and dependency global algorithm, an accuracy of 92.85% is obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chapter 9: Regular expressions, the Open Group Base Specifications Issue 6, IEEE Std 1003.1, 2004 Edition. The Open Group (2004)
Hindi dependency treebank, workshop on MT and Parsing in Indian Languages. 24th International Conference on Computational Linguistics (2012)
Girju, R., Maldovan, D., Tatu, M., Antohe, D.: On the semantics of noun compounds. Computer Speech and Language (2005)
Jaccard, P.: The distribution of the flora in the alpine zone. The New Phytologist (1912)
Kavuluru, R., Harris, D.: A knowledge-based approach to syntactic disambiguation of biomedical noun compounds. In: Proceedings of COLING (2012)
Kulkarni, A., Kumar, A.: Statistical constituency parser for sanskrit compounds. In: Proceedings of ICON (2011)
Kulkarni, A., Paul, S., Kulkarni, M., Kumar, A., Surtani, N.: Semantic processing of compounds in indian languages. In: Proceedings of COLING (2012)
Lapata, M., Keller, F.: Web-based models for natural language processing. ACM (2005)
Lauer, M.: Corpus statistics meet the noun compound: Some empirical results (1995)
Pecina, P.: Lexical association measures. Institute of Formal and Applied Linguistics (2009)
Pustejovsky, J., Bergler, S., Anick, P.: Lexical semantic techniques for corpus analysis. Association for Computational Linguistics (1993)
Sharma, S.: Disambiguating the parsing of hindi recursive genitive constructions, IIIT Hyderabad, India (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Batra, A., Paul, S., Kulkarni, A. (2014). Constituency Parsing of Complex Noun Sequences in Hindi. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-54906-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)