Abstract
We present a nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities. Its performance is significantly higher than previous corpus-based methods for PP-attachment that do not rely on manually constructed knowledge bases. We will also show that the PP-attachment task provides a way to evaluate methods for computing distributional word similarities. Our experiments indicate that the cosine of pointwise mutual information vector is a significantly better similarity measure than several other commonly used similarity measures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S., Schapire, R.E., Singer, Y.: Boosting Applied to Tagging and PPattachment. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP-VLC, College Park, MD, pp. 38–45 (1999)
Altmann, G., Steedman, M.: Interaction with Context During Human Sentence Processing. Cognition 30, 191–238 (1988)
Brill, E.: Transformation-based Error-driven Learning and Natural Language Processing: A case study in part of speech tagging. Computational Linguistics (December 1995)
Brill, E., Resnik, P.: A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation. In: Proceedings of COLING 1994, Kyoto, Japan, pp. 1198–1204 (1994)
Collins, M., Brooks, J.: Prepositional Phrase Attachment through a Backed-off Model. In: Proceedings of the Third Workshop on Very Large Corpora, Cambridge, Massachusetts, pp. 27–38 (1995)
Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Boston (1994)
Harris, Z.S.: Mathematical Structures of Language. Wiley, New York (1968)
Hays, D.: Dependency Theory: a Formalism and Some Observations. Language 40, 511–525 (1964)
Hindle, D.: Noun Classification from Predicate-Argument Structures. In: Proceedings of ACL 1990, Pittsburgh, Pennsylvania, pp. 268–275 (1990)
Hindle, D., Rooth, M.: Structural Ambiguity and Lexical Relations. Computational Linguistics 19(1), 103–120 (1993)
Hudson, R.: Word Grammar. Basil Blackwell Publishers Limited, Oxford (1984)
Li, H.: Word Clustering and Disambiguation based on Co-occurrence Data. Natural Language Engineering 8(1), 25–42 (2002)
Lin, D.: Automatic Retrieval and Clustering of Similar Words. In: Proceedings of COLING-ACL 1998, Montreal, Canada (1998)
Lin, D.: Principar - an Efficient, Broad-Coverage, Principle-Based Parser. In: Proceedings of COLING 1994, Kyoto, Japan (1994)
Lin, D.: Dependency-based evaluation of MINIPAR. In: Abeille, A. (ed.) Building and using syntactically annotated corpora, pp. 317–330. Kluwer, Dordrecht (2003)
Lin, J.: Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory 37(1), 145–151 (1991)
Mel’čuk, I.A.: Dependency Syntax: theory and practice. State University of New York Press, Albany (1987)
Miller, G.: WordNet: an On-Line Lexical Database. International Journal of Lexicography (1990)
Pereira, F., Tishby, N., Lee, L.: Distributional Clustering of English Words. In: Proceedings of ACL 1993, Columbus, Ohio, pp. 183–190 (1993)
Rao, C.R.: Diversity: Its measurement, decomposition, apportionment and analysis. Sankyha: The Indian Journal of Statistics 44(A), 1–22 (1982)
Ratnaparkhi, A., Reynar, J., Roukos, S.: A Maximum Entropy Model for Prepositional Phrase Attachment. In: Proceedings of the ARPA Human Language Technology Workshop, Plainsboro, N.J, pp. 250–255 (1994)
Stetina, J., Nagao, M.: Corpus Based PP Attachment Ambiguity Resolution with a Semantic Dictionary. In: Proceedings of the Fifth Workshop on Very Large Corpora, Beijing, Hong Kong, pp. 66–80 (1997)
Terra, E.L., Clarke, C.: Frequency Estimates for Statistical Word Similarity Measures. In: the Proceedings of the 2003 Human Language Technology Conference, Edmonton, Canada, May 2003, pp. 244–251 (2003)
Turney, P.D.: Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)
Zavrel, J., Daelemans, W., Veenstra, J.: Resolving PP attachment Ambiguities with Memory-Based Learning. In: Proceedings of the Conference on Computational Natural Language Learning, CoNLL 1997, Madrid, Spain, pp. 136–144 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, S., Lin, D. (2005). A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_58
Download citation
DOI: https://doi.org/10.1007/978-3-540-30211-7_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)