Abstract
We describe the verb lexicon of OpenWordNet-PT, a wordnet-like resource for (mostly Brazilian) Portuguese and a series of experiments that we designed to extend its coverage. These experiments include checking online lists of most common verbs, checking corpora freely available such as the Bosque-UD (the Bosque corpus annotated with Universal Dependencies) and especially checking a dictionary of Brazilian politicians’ biographies (the DHBB) that we consider an ideal corpus for the kind of information extraction we are after. We certainly succeeded into extending the coverage of the verb lexicon, however it remains to be seen whether this new coverage is enough for the original application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For up to date numbers check http://wnpt.brlcloud.com/wn/stats.
- 2.
- 3.
To be found at http://www.nilc.icmc.usp.br/verbnetbr/.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Afonso, S., Bick, E., Haber, R., Santos, D.: Floresta sintá(c)tica: a treebank for Portuguese. In: Proceedings of LREC 2002 (2002)
van Assem, M., Gangemi, A., Schreiber, G.: RDF/OWL representation of WordNet. W3C Working Draft, World Wide Web Consortium, June 2006. http://www.w3.org/TR/2006/WD-wordnet-rdf-20060619/
Baptista, J.: Viper: a lexicon-grammar of european portuguese verbs. In: 31e Colloque International sur le Lexique et la Grammaire (2012)
Bick, E.: The Parsing System Palavras - Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Famework. Ph.D. thesis, Aarhus University (2000)
Bond, F., Paik, K.: A survey of wordnets and their licenses. In: Proceedings of the 6th Global WordNet Conference (GWC 2012), Matsue, pp. 64–71 (2012)
Cançado, M., Godoy, L., Amaral, L.: The construction of a catalog of Brazilian Portuguese verbs. In: Jancsary, J. (ed.) Empirical Methods in Natural Language Processing: Proceedings of the Conference on Natural Language Processing (2012)
Carreras, X., Chao, I., Padró, L., Padró, M.: Freeling: an open-source suite of language analyzers. In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004) (2004)
Chklovski, T., Pantel, P.: VerbOcean: mining the web for fine-grained semantic verb relations. In: Proceedings of EMNLP (2004)
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge (1998)
Ferreira, J.P., Janssen, M., Correia, M., De Oliveira, G.M.: The common orthographic vocabulary of the Portuguese language: a set of open lexical resources for a pluricentric language. In: Proceedings of LREC (2012)
Lewis, M.P., Simons, G.F., Fennig, C.D. (eds.): Ethnologue: Languages of the World, 18th edn. SIL International, Dallas (2015). http://www.ethnologue.com
da-Silva, B.C.D., de Moraes, H.R.: A construção de um thesaurus para o português do Brasil. Alfa 47(2), 101–115 (2003)
de Paiva, V., Freitas, C., Real, L., Rademaker, A.: Improving the verb lexicon of OpenWordnet-PT. In: Alemany, L.A., Padró, M., Rademaker, A., Villavicencio, A. (eds.) Proceedings of Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish (ToRPorEsp). Biblioteca Digital Brasileira de Computação, UFMG, Brazil, São Carlos, Brazil, October 2014. http://www.lbd.dcc.ufmg.br/bdbcomp/servlet/Evento?id=755
de Paiva, V., Rademaker, A., de Melo, G.: Openwordnet-pt: an open Brazilian Wordnet for reasoning. In: Proceedings of COLING 2012: Demonstration Papers, pp. 353–360. The COLING 2012 Organizing Committee, Mumbai, December 2012. http://www.aclweb.org/anthology/C12-3044, Published also as Techreport. http://hdl.handle.net/10438/10274
Paiva, V.D., Oliveira, D., Higuchi, S., Rademaker, A., Melo, G.D.: Exploratory information extraction from a historical dictionary. In: IEEE 10th International Conference on e-Science (e-Science). vol. 2, pp. 11–18. IEEE, October 2014
Pease, A.: Ontology: A Practical Guide. Articulate Software Press, Angwin (2011)
Real, L., Chalub, F., de Paiva, V., Freitas, C., Rademaker, A.: Seeing is correcting: curating lexical resources using social interfaces. In: Proceedings of 53rd Annual Meeting of the Association for Computational Linguistics and The 7th International Joint Conference on Natural Language Processing of Asian Federation of Natural Language Processing - Fourth Workshop on Linked Data in Linguistics: Resources and Applications (LDL 2015), Beijing, China, July 2015
Scarton, C., Aluísio, S.: Towards a cross-linguistic Verbnet-style lexicon for Brazilian Portuguese. In: Workshop on Creating Cross-language Resources for Disconnected Languages and Styles Workshop Programme, p. 11 (2012)
Scarton, C., Sun, L., Kipper-Schuler, K., Duran, M.S., Palmer, M., Korhonen, A.: Verb clustering for brazilian portuguese. In: Gelbukh, A. (ed.) CICLing 2014, Part I. LNCS, vol. 8403, pp. 25–39. Springer, Heidelberg (2014)
Scarton, C.E.: VerbNet.BR: construção semiautomática de um léxico computacional de verbos para o português do brasil. In: 8th Brazilian Symposium in Information and Human Language Technology, pp. 20–29 (2011)
Silveira, N., Dozat, T., de Marneffe, M.C., Bowman, S., Connor, M., Bauer, J., Manning, C.D.: A gold standard dependency corpus for English. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) (2014)
Simões, A., Guinovart, X.G.: Bootstrapping a Portuguese wordnet from Galician, Spanish and English wordnets. In: Navarro Mesa, J.L., Ortega, A., Teixeira, A., Hernández Pérez, E., Quintana Morales, P., Ravelo García, A., Guerra Moreno, I., Toledano, D.T. (eds.) IberSPEECH 2014. LNCS, vol. 8854, pp. 239–248. Springer, Heidelberg (2014). http://dx.doi.org/10.1007/978-3-319-13623-3_25
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
de Paiva, V., Chalub, F., Real, L., Rademaker, A. (2016). Making Virtue of Necessity: A Verb Lexicon. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-41552-9_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41551-2
Online ISBN: 978-3-319-41552-9
eBook Packages: Computer ScienceComputer Science (R0)