Skip to main content

The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser

  • Conference paper
Natural Language Processing – IJCNLP 2004 (IJCNLP 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

  • 1627 Accesses

Abstract

We describe the automatic acquisition of a lexicon of verb subcategorisations from a domain-specific corpus, and an evaluation of the impact this lexicon has on the performance of a “deep”, HPSG parser of English. We conducted two experiments to determine whether the empirically extracted verb stems would enhance the lexical coverage of the grammar and to see whether the automatically extracted verb subcategorisations would result in enhanced parser coverage. In our experiments, the empirically extracted verbs enhance lexical coverage by 8.5%. The automatically extracted verb subcategorisations enhance the parse success rate by 15% in theoretical terms and by 4.5% in practice. This is a promising approach for improving the robustness of deep parsing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Pollard, C., Sag, I.: Head-Driven Phrase Structure Grammar. Chicago University Press, Chicago (1994)

    Google Scholar 

  2. Copestake, A., Flickinger, D.: An open-source grammar development environment and broad-coverage English grammar using HPSG. In: Proceedings of LREC 2000, Athens, Greece (2000)

    Google Scholar 

  3. Briscoe, E., Carroll, J.: Robust accurate statistical annotation of general text. In: Proceedings of the 3rd International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria., pp. 1499–1504 (2002)

    Google Scholar 

  4. Minnen, G., Carroll, J., Pearce, D.: Applied morphological processing of English. Natural Language Engineering 7(3), 207–223 (2001)

    Article  Google Scholar 

  5. Briscoe, E., Carroll, J.: Automatic extraction of subcategorization from corpora. In: Proceedings of the 5th ACL Conference on Applied Natural Language Processing, Washington, DC, pp. 356–363 (1997)

    Google Scholar 

  6. Korhonen, A.: Subcategorization Acquisition. PhD thesis published as Techical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge (2002)

    Google Scholar 

  7. Grishman, R., Macleod, C., Meyers, A.: Comlex syntax: Building a computational lexicon. In: Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan, pp. 268–272 (1994)

    Google Scholar 

  8. Callmeier, U.: PET. A platform for experimentation with efficient HPSG processing techniques. Natural Language Engineering 6(1), 99–108 (2000) (Special Issue on Efficient Processing with HPSG)

    Article  Google Scholar 

  9. Oepen, S.: [incr tsdb()]: Competence and Performance Laboratory: User & Reference Manual, Computational Linguistics, Saarland University, Saarbrücken, Germany (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Carroll, J., Fang, A.C. (2005). The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_68

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30211-7_68

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24475-2

  • Online ISBN: 978-3-540-30211-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics