A Feasibility Study on Low Level Techniques for Improving Parsing Accuracy for Spanish Using Maltparser

Ballesteros, Miguel; Herrera, Jesús; Francisco, Virginia; Gervás, Pablo

doi:10.1007/978-3-642-12842-4_8

Miguel Ballesteros²¹,
Jesús Herrera²¹,
Virginia Francisco²² &
…
Pablo Gervás²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6040))

Included in the following conference series:

Hellenic Conference on Artificial Intelligence

2185 Accesses
2 Citations

Abstract

In the last years dependency parsing has been accomplished by machine learning–based systems showing great accuracy but usually under 90% for Labelled Attachment Score (LAS). Maltparser is one of such systems. Machine learning allows to obtain parsers for every language having an adequate training corpus. Since generally such systems can not be modified the following question arises: Can we beat this 90% LAS by using better training corpora? Some previous work points that high level techniques are not sufficient for building more accurate training corpora. Thus, by analyzing the words that are more frequently incorrectly attached or labelled, we study the feasibility of some low level techniques, based on n–version parsing models, in order to obtain better parsing accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Exploring Different Approaches for Parsing Telugu

Coarse-Grained vs. Fine-Grained Lithuanian Dependency Parsing

PassPort: A Dependency Parsing Model for Portuguese

References

Buchholz, S., Marsi, E.: CoNLL–X Shared Task on Multilingual Dependency Parsing. In: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL–X), pp. 149–164 (2006)
Google Scholar
Ballesteros, M., Herrera, J., Francisco, V., Gervás, P.: Improving Parsing Accuracy for Spanish using Maltparser. Journal of the Spanish Society for Natural Language Processing (SEPLN) 44 (in press, 2010)
Google Scholar
Herrera, J., Gervás, P.: Towards a Dependency Parser for Greek Using a Small Training Data Set. Journal of the Spanish Society for Natural Language Processing (SEPLN) 41, 29–36 (2008)
Google Scholar
Herrera, J., Gervás, P., Moriano, P.J., Moreno, A., Romero, L.: Building Corpora for the Development of a Dependency Parser for Spanish Using Maltparser. Journal of the Spanish Society for Natural Language Processing (SEPLN) 39, 181–186 (2007)
Google Scholar
Herrera, J., Gervás, P., Moriano, P.J., Moreno, A., Romero, L.: JBeaver: un Analizador de Dependencias para el Español Basado en Aprendizaje. In: Borrajo, D., Castillo, L., Corchado, J.M. (eds.) CAEPIA 2007. LNCS (LNAI), vol. 4788, pp. 211–220. Springer, Heidelberg (2007)
Google Scholar
Nivre, J., Hall, J., Nilsson, J.: Memory–based Dependency Parsing. In: Proceedings of CoNLL–2004, Boston, MA, USA, pp. 49–56 (2004)
Google Scholar
Eisner, J.: Three New Probabilistic Models for Dependency Parsing: An Exploration. In: Proceedings of the 16th International Conference on Computational Linguistics (COLING 1996), Copenhagen, pp. 340–345 (1996)
Google Scholar
Yamada, H., Matsumoto, Y.: Statistical Dependency Analysis with Support Vector Machines. In: Proceedings of International Workshop of Parsing Technologies (IWPT 2003), pp. 195–206 (2003)
Google Scholar
Palomar, M., Civit, M., Díaz, A., Moreno, L., Bisbal, E., Aranzabe, M., Ageno, A., Martí, M.A., Navarro, B.: 3LB: Construcción de una Base de Datos de Árboles Sintáctico–Semánticos para el Catalán, Euskera y Español. In: Proceedings of the XX Conference of the Spanish Society for Natural Language Processing (SEPLN), Sociedad Española para el Procesamiento del Lenguaje Natural, pp. 81–88 (2004)
Google Scholar
Taulé, M., Martí, M., Recasens, M.: AnCora: Multilevel Annotated Corpora for Catalan and Spanish. In: Proceedings of 6th International Conference on Language Resources and Evaluation (2008)
Google Scholar
McDonald, R., Lerman, K., Pereira, F.: Multilingual Dependency Analysis with a Two-Stage Discriminative Parser. In: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL–X), pp. 216–220 (2006)
Google Scholar
Nivre, J., Hall, J., Nilsson, J., Eryiğit, G., Marinov, S.: Labeled Pseudo–Projective Dependency Parsing with Support Vector Machines. In: Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL–X), pp. 221–225 (2006)
Google Scholar
Johansson, R., Nugues, P.: Investigating Multilingual Dependency Parsing. In: Proceedings of the Conference on Computational Natural Language Learning, CoNLL–X (2006)
Google Scholar
Wu, Y., Lee, Y., Yang, J.: The Exploration of Deterministic and Efficient Dependency Parsing. In: Proceedings of the Conference on Computational Natural Language Learning, CoNLL–X (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Ingeniería del Software e Inteligencia Artificial,
Miguel Ballesteros & Jesús Herrera
Instituto de Tecnología del Conocimiento, Universidad Complutense de Madrid, C/ Profesor José García Santesmases, s/n, E–28040, Madrid, Spain
Virginia Francisco & Pablo Gervás

Authors

Miguel Ballesteros
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Herrera
View author publications
You can also search for this author in PubMed Google Scholar
Virginia Francisco
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Gervás
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Informatics and Telecommunications, NCSR Demokritos, Ag. Paraskevi, 15310, Athens, Greece
Stasinos Konstantopoulos , Stavros Perantonis , Vangelis Karkaletsis & Constantine D. Spyropoulos , , &
Department of Information and Communication Systems Engineering, University of the Aegean, 83200, Karlovassi, Samos, Greece
George Vouros

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ballesteros, M., Herrera, J., Francisco, V., Gervás, P. (2010). A Feasibility Study on Low Level Techniques for Improving Parsing Accuracy for Spanish Using Maltparser. In: Konstantopoulos, S., Perantonis, S., Karkaletsis, V., Spyropoulos, C.D., Vouros, G. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2010. Lecture Notes in Computer Science(), vol 6040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12842-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-12842-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12841-7
Online ISBN: 978-3-642-12842-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics