Skip to main content

A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity

  • Conference paper
Natural Language Processing – IJCNLP 2004 (IJCNLP 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

Abstract

We present a nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities. Its performance is significantly higher than previous corpus-based methods for PP-attachment that do not rely on manually constructed knowledge bases. We will also show that the PP-attachment task provides a way to evaluate methods for computing distributional word similarities. Our experiments indicate that the cosine of pointwise mutual information vector is a significantly better similarity measure than several other commonly used similarity measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abney, S., Schapire, R.E., Singer, Y.: Boosting Applied to Tagging and PPattachment. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP-VLC, College Park, MD, pp. 38–45 (1999)

    Google Scholar 

  2. Altmann, G., Steedman, M.: Interaction with Context During Human Sentence Processing. Cognition 30, 191–238 (1988)

    Article  Google Scholar 

  3. Brill, E.: Transformation-based Error-driven Learning and Natural Language Processing: A case study in part of speech tagging. Computational Linguistics (December 1995)

    Google Scholar 

  4. Brill, E., Resnik, P.: A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation. In: Proceedings of COLING 1994, Kyoto, Japan, pp. 1198–1204 (1994)

    Google Scholar 

  5. Collins, M., Brooks, J.: Prepositional Phrase Attachment through a Backed-off Model. In: Proceedings of the Third Workshop on Very Large Corpora, Cambridge, Massachusetts, pp. 27–38 (1995)

    Google Scholar 

  6. Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Boston (1994)

    MATH  Google Scholar 

  7. Harris, Z.S.: Mathematical Structures of Language. Wiley, New York (1968)

    MATH  Google Scholar 

  8. Hays, D.: Dependency Theory: a Formalism and Some Observations. Language 40, 511–525 (1964)

    Article  Google Scholar 

  9. Hindle, D.: Noun Classification from Predicate-Argument Structures. In: Proceedings of ACL 1990, Pittsburgh, Pennsylvania, pp. 268–275 (1990)

    Google Scholar 

  10. Hindle, D., Rooth, M.: Structural Ambiguity and Lexical Relations. Computational Linguistics 19(1), 103–120 (1993)

    Google Scholar 

  11. Hudson, R.: Word Grammar. Basil Blackwell Publishers Limited, Oxford (1984)

    Google Scholar 

  12. Li, H.: Word Clustering and Disambiguation based on Co-occurrence Data. Natural Language Engineering 8(1), 25–42 (2002)

    Article  Google Scholar 

  13. Lin, D.: Automatic Retrieval and Clustering of Similar Words. In: Proceedings of COLING-ACL 1998, Montreal, Canada (1998)

    Google Scholar 

  14. Lin, D.: Principar - an Efficient, Broad-Coverage, Principle-Based Parser. In: Proceedings of COLING 1994, Kyoto, Japan (1994)

    Google Scholar 

  15. Lin, D.: Dependency-based evaluation of MINIPAR. In: Abeille, A. (ed.) Building and using syntactically annotated corpora, pp. 317–330. Kluwer, Dordrecht (2003)

    Google Scholar 

  16. Lin, J.: Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory 37(1), 145–151 (1991)

    Article  MATH  Google Scholar 

  17. Mel’čuk, I.A.: Dependency Syntax: theory and practice. State University of New York Press, Albany (1987)

    Google Scholar 

  18. Miller, G.: WordNet: an On-Line Lexical Database. International Journal of Lexicography (1990)

    Google Scholar 

  19. Pereira, F., Tishby, N., Lee, L.: Distributional Clustering of English Words. In: Proceedings of ACL 1993, Columbus, Ohio, pp. 183–190 (1993)

    Google Scholar 

  20. Rao, C.R.: Diversity: Its measurement, decomposition, apportionment and analysis. Sankyha: The Indian Journal of Statistics 44(A), 1–22 (1982)

    MATH  Google Scholar 

  21. Ratnaparkhi, A., Reynar, J., Roukos, S.: A Maximum Entropy Model for Prepositional Phrase Attachment. In: Proceedings of the ARPA Human Language Technology Workshop, Plainsboro, N.J, pp. 250–255 (1994)

    Google Scholar 

  22. Stetina, J., Nagao, M.: Corpus Based PP Attachment Ambiguity Resolution with a Semantic Dictionary. In: Proceedings of the Fifth Workshop on Very Large Corpora, Beijing, Hong Kong, pp. 66–80 (1997)

    Google Scholar 

  23. Terra, E.L., Clarke, C.: Frequency Estimates for Statistical Word Similarity Measures. In: the Proceedings of the 2003 Human Language Technology Conference, Edmonton, Canada, May 2003, pp. 244–251 (2003)

    Google Scholar 

  24. Turney, P.D.: Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  25. Zavrel, J., Daelemans, W., Veenstra, J.: Resolving PP attachment Ambiguities with Memory-Based Learning. In: Proceedings of the Conference on Computational Natural Language Learning, CoNLL 1997, Madrid, Spain, pp. 136–144 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhao, S., Lin, D. (2005). A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30211-7_58

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24475-2

  • Online ISBN: 978-3-540-30211-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics