PSO-Tagger: A New Biologically Inspired Approach to the Part-of-Speech Tagging Problem

Silva, Ana Paula; Silva, Arlindo; Rodrigues, Irene

doi:10.1007/978-3-642-37213-1_10

Ana Paula Silva¹⁷,
Arlindo Silva¹⁷ &
Irene Rodrigues¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7824))

Included in the following conference series:

International Conference on Adaptive and Natural Computing Algorithms

Abstract

In this paper we present an approach to the part-of-speech tagging problem based on particle swarm optimization. The part-of-speech tagging is a key input feature for several other natural language processing tasks, like phrase chunking and named entity recognition. A tagger is a system that should receive a text, made of sentences, and, as output, should return the same text, but with each of its words associated with the correct part-of-speech tag. The task is not straightforward, since a large percentage of words have more than one possible part-of-speech tag, and the right choice is determined by the part-of-speech tags of the surrounding words, which can also have more than one possible tag. In this work we investigate the possibility of using a particle swarm optimization algorithm to solve the part-of-speech tagging problem supported by a set of disambiguation rules. The results we obtained on two different corpora are amongst the best ones published for those corpora.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brants, T.: Tnt: a statistical part-of-speech tagger. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, ANLC 2000, pp. 224–231. Association for Computational Linguistics, Stroudsburg (2000)
Chapter Google Scholar
Araujo, L.: Part-of-Speech Tagging with Evolutionary Algorithms. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 230–239. Springer, Heidelberg (2002)
Chapter Google Scholar
Araujo, L., Luque, G., Alba, E.: Metaheuristics for Natural Language Tagging. In: Deb, K., Tari, Z. (eds.) GECCO 2004. LNCS, vol. 3102, pp. 889–900. Springer, Heidelberg (2004)
Chapter Google Scholar
Alba, E., Luque, G., Araujo, L.: Natural language tagging with genetic algorithms. Inf. Process. Lett. 100(5), 173–182 (2006)
Article MathSciNet MATH Google Scholar
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995)
Google Scholar
Wilson, G., Heywood, M.: Use of a genetic algorithm in brill’s transformation-based part-of-speech tagger. In: Proceedings of the 2005 Conference on Genetic and Evolutionary Computation, GECCO 2005, pp. 2067–2073. ACM, New York (2005)
Chapter Google Scholar
Nogueira dos Santos, C., Milidiú, R.L., Rentería, R.P.: Portuguese Part-of-Speech Tagging Using Entropy Guided Transformation Learning. In: Teixeira, A., de Lima, V.L.S., de Oliveira, L.C., Quaresma, P. (eds.) PROPOR 2008. LNCS (LNAI), vol. 5190, pp. 143–152. Springer, Heidelberg (2008)
Chapter Google Scholar
Steven Bird, E.K., Loper, E.: Natural Language Processing with Python. O’Reilly Media (2009)
Google Scholar
Poli, R.: Analysis of the publications on the applications of particle swarm optimisation. J. Artif. Evol. App., 4:1–4:10 (January 2008)
Google Scholar
Kennedy, J., Eberhart, R.C.: Swarm intelligence. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Google Scholar
Hindle, D.: Acquiring disambiguation rules from text (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Escola Superior de Tecnologia do Instituto Politécnico de Castelo Branco, Portugal
Ana Paula Silva & Arlindo Silva
Universidade de Évora, Portugal
Irene Rodrigues

Authors

Ana Paula Silva
View author publications
You can also search for this author in PubMed Google Scholar
Arlindo Silva
View author publications
You can also search for this author in PubMed Google Scholar
Irene Rodrigues
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Départment des Systémes d’Information, Quartier UNIL-Dorigny, Bâtiment Internef, Université de Lausanne, 105, Lausanne, Switzerland
Marco Tomassini , Alberto Antonioni , Fabio Daolio & Pierre Buesser , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Silva, A.P., Silva, A., Rodrigues, I. (2013). PSO-Tagger: A New Biologically Inspired Approach to the Part-of-Speech Tagging Problem. In: Tomassini, M., Antonioni, A., Daolio, F., Buesser, P. (eds) Adaptive and Natural Computing Algorithms. ICANNGA 2013. Lecture Notes in Computer Science, vol 7824. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37213-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-37213-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37212-4
Online ISBN: 978-3-642-37213-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics