Abstract
We describe the automatic acquisition of a lexicon of verb subcategorisations from a domain-specific corpus, and an evaluation of the impact this lexicon has on the performance of a “deep”, HPSG parser of English. We conducted two experiments to determine whether the empirically extracted verb stems would enhance the lexical coverage of the grammar and to see whether the automatically extracted verb subcategorisations would result in enhanced parser coverage. In our experiments, the empirically extracted verbs enhance lexical coverage by 8.5%. The automatically extracted verb subcategorisations enhance the parse success rate by 15% in theoretical terms and by 4.5% in practice. This is a promising approach for improving the robustness of deep parsing.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Pollard, C., Sag, I.: Head-Driven Phrase Structure Grammar. Chicago University Press, Chicago (1994)
Copestake, A., Flickinger, D.: An open-source grammar development environment and broad-coverage English grammar using HPSG. In: Proceedings of LREC 2000, Athens, Greece (2000)
Briscoe, E., Carroll, J.: Robust accurate statistical annotation of general text. In: Proceedings of the 3rd International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria., pp. 1499–1504 (2002)
Minnen, G., Carroll, J., Pearce, D.: Applied morphological processing of English. Natural Language Engineering 7(3), 207–223 (2001)
Briscoe, E., Carroll, J.: Automatic extraction of subcategorization from corpora. In: Proceedings of the 5th ACL Conference on Applied Natural Language Processing, Washington, DC, pp. 356–363 (1997)
Korhonen, A.: Subcategorization Acquisition. PhD thesis published as Techical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge (2002)
Grishman, R., Macleod, C., Meyers, A.: Comlex syntax: Building a computational lexicon. In: Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan, pp. 268–272 (1994)
Callmeier, U.: PET. A platform for experimentation with efficient HPSG processing techniques. Natural Language Engineering 6(1), 99–108 (2000) (Special Issue on Efficient Processing with HPSG)
Oepen, S.: [incr tsdb()]: Competence and Performance Laboratory: User & Reference Manual, Computational Linguistics, Saarland University, Saarbrücken, Germany (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Carroll, J., Fang, A.C. (2005). The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_68
Download citation
DOI: https://doi.org/10.1007/978-3-540-30211-7_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)