Abstract
Syntactic parsing is an important processing step for various language processing applications including Information Extraction, Question Answering, and Machine Translation. Parsing base Noun Phrases is one particular parsing issue that is not handled by current state-of-the-art syntactic parsers. In this paper we present research that investigates the base Noun Phrase parsing problem. We develop a base Noun Phrase parser based on several statistical models that provide promising results on a test set of 538 base Noun Phrases. The parameters of the models are estimated from the web in the form of web counts. This makes our method unsupervised with no training data being needed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cao, Y., Li, H.: Base Noun Phrase Translation Using Web Data and the EM Algorithm. In: International Conference on Computational Linguistics (COLING) (2002)
Charniak, E.: Statistical Parsing with a context-free grammar and word statistics. In: Proceedings of the Fourteenth National Conference on Artificial Intelligence, AAAI Press, Menlo Park (1997)
Collins, M.: A New Statistical Parser Based on Bigram Lexical Dependencies. In: Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, CA (1996)
Daume III, H., Marcu, D.: NP Bracketing by Maximum Entropy Tagging and SVM Reranking. In: Proceedings of Empirical Methods in Natural Language Processing (2004)
Gildea, D.: Loosely Tree-Based Alignment for Machine Translation. In: Proceedings of the 41th Annual Conference of the Association for Computational Linguistics, Sapporo, Japan (2003)
Hobbs, J.R.: Information Extraction from Biomedical Text. In: Journal of Biomedical Informatics (2003)
Jurafsky, D., Martin, J.: Speech and Language Processing. Prentice-Hall, Englewood Cliffs (2000)
Keller, F., Lapata, M.: Using the Web to Obtain Frequencies for Unseen Bigrams. Computational Linguistics 29(3), 459–484 (2003)
Krymolowski, Y., Dagan, I.: Compositional Memory-Based Partial Parsing. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), pp. 45–52 (2000)
Lappin, S., Leass, H.J.: An Algorithm for Pronominal Anaphora Resolution. Computational Linguistics 20(4), 535–561 (1994)
Magerman, D.M.: Natural Language Parsing as Statistical Pattern Recognition, PhD Thesis, Stanford University (February 1994)
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Markham, J., Rus, V.: On the Implementation of a Baseline Part-of-Speech Tagger. In: Third Mid-South College Computing Conference (MSCCC-05), Oxford, Mississippi, April (2005)
Miller, G.A.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1995)
Nakov, P., Hearst, M.: Search Engine Statistics Beyond the n-gram: Application to Noun Compound Bracketing. In: Proceedings of Ninth Conference on Computational Natural Language Learning, Ann Arbor, MI, June (2005)
Ramshaw, L.A., Marcus, M.P.: Text Chunking Using Transformation-Based Learning. In: Proceedings of the Third Workshop on Very Large Corpora, Cambridge, MA, USA (1995)
Rus, V.: High Precision Logic Form Transformation. In: International Conference on Tools with Artificial Intelligence (ICTAI), Dallas, TX, November (2001)
Rus, V., Moldovan, D.I., Bolohan, O.: Bracketing Compound Nouns for Logic Form Derivation. In: Proceedings of the FLAIRS 2002 Conference, Pensacola, Florida, May (2002)
Sang, T.K., Erik, F.: Noun Phrase Recognition by System Combination. In: Proceedings of ANLP-NAACL 2000, Seattle, Washington, USA, pp. 50–55. Morgan Kaufmann Publishers, San Francisco (2000)
Voorhees, E.: Overview of the TREC 2002 Question Answering Track. In: Proceedings of the Eleventh Text Retrieval Conference (TREC 2002) (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rus, V., Ravi, S., Lintean, M.C., McCarthy, P.M. (2007). Unsupervised Method for Parsing Coordinated Base Noun Phrases. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-70939-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70938-1
Online ISBN: 978-3-540-70939-8
eBook Packages: Computer ScienceComputer Science (R0)