Abstract
The growth of social media and user-generated content (UGC) on the Internet provides a huge quantity of information that allows discovering the experiences, opinions, and feelings of users or customers. Opinion Mining (OM) is a sub-field of text mining in which the main task is to extract opinions from UGC. Given that Portuguese is one of the most common spoken languages in the world, and it is also the second most frequent on Twitter, the goal of this work is to plot the landscape of current studies that relates the application of OM for Portuguese. A systematic mapping review (SMR) method was applied to search, select and to extract data from the included studies. Manual and automated searches retrieved 6075 studies up to year 2014, from which 25 articles were included. Almost 70 % of all approaches focus on the Brazilian Portuguese variant. Naïve Bayes and Support Vector Machine were the main classifiers and SentiLex-PT was the most used lexical resource. Portugal and Brazil are the main contributors in processing the Portuguese language.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Brazil (202,7 M), Mozambique (24,7 M), Angola (24,3 M), Portugal (10,9 M), Guinea-Bissau (1,7 M), East Timor (1,2 M), Equatorial Guinea (722,254), Macau (587,914), Cabo Verde (538,535) and São Tomé e Príncipe (190,428). From US/CIA - World Factbook (July, 2014).
- 2.
IEEE Xplore, ACM, Science Direct, Scopus, Portal de Periódicos Capes and SciELO.
- 3.
Inter. Conf. Comput. Processing of Portuguese (PROPOR), Text Mining and Applications (TEMA), Brazilian Workshop of Social Network Analysis and Mining (BRASNAM), Brazilian Symposium on Information and Human Language Technology (STIL), ACM symposium on Document engineering (DocEng), Linguateca Database (www.linguateca.pt), Message Understanding Conferences (MUC), Text Analysis Conference (TAC), Text REtrieval Conference (TREC), Document Understanding Conference (DUC).
References
Marine-Roig, E.: Anton Clavé, S.: Tourism analytics with massive user-generated content: a case study of Barcelona. J. Destin Mark. Manag. 4, 1–11 (2015)
Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89, 14–46 (2015)
Balazs, J.A., Velásquez, J.D.: Opinion mining and information fusion: a survey. Inf. Fusion. 27, 95–110 (2016)
Hotho, A., Andreas, N., Paaß, G., Augustin, S.: A brief survey of text mining, pp. 1–37 (2005)
Tan, A.: Text mining: the state of the art and the challenges concept-based. In: Proceedings of the PAKDD 1999 Workshop on Knowledge Disocovery from Advanced Databases, pp.65–70 (1999)
Poblete, B., Garcia, R., Mendoza, M., Jaimes, A.: Do all birds tweet the same? Characterizing twitter around the world categories and subject descriptors. In: Society, pp. 1025–1030 (2011)
Petersen, K., Feldt, R., Mujtaba, S., Mattsson, M.: Systematic mapping studies in software engineering, pp. 1–10 (2007)
Kitchenham, B., Charters, S.: Guidelines for performing systematic literature reviews in software engineering. Technical Report EBSE-2007-01, School of Computer Science and Mathematics, Keele University (2007)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2, 1–135 (2008)
Varghese, R., Jayasree, M.: A Survey on Sentiment Analysis and Opinion Mining, pp. 312–317 (2013). www.Ijret.Org
Vinodhini, G., Chandrasekaran, R.: Sentiment analysis and opinion mining: a Survey. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2, 282–292 (2012)
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5, 1093–1113 (2014)
Pardo, T., Gasperin, C., Caseli, H., Nunes, M.D.G.V.: Computational linguistics in Brazil: an overview. In: Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, pp. 1–7 (2010)
da Silva Conrado, M., Felippo, A., Pardo, T.S., Rezende, S.: A survey of automatic term extraction for Brazilian Portuguese. J. Braz. Comput. Soc. 20, 12 (2014)
de Freitas, L.A., Vieira, R.: Ontology-based feature level opinion mining for Portuguese reviews. In: WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web, pp. 367–370 (2013)
Souza, M., Vieira, R., Chishman, R., Alves, I.M.: Construction of a Portuguese opinion lexicon from multiple resources. In: Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, pp. 59–66 (2011)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37, 267–307 (2011)
da Fonseca Carvalho, P.C.Q.: Análise e representação de construções adjectivais para processamento automático de texto: adjectivos intransitivos humanos (2007)
Evangelista, T.R., Padilha, T.P.P.: Monitoramento de Posts Sobre Empresas de E-Commerce em Redes Sociais Utilizando Análise de Sentimentos (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Souza, E., Vitório, D., Castro, D., Oliveira, A.L.I., Gusmão, C. (2016). Characterizing Opinion Mining: A Systematic Mapping Study of the Portuguese Language. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-41552-9_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41551-2
Online ISBN: 978-3-319-41552-9
eBook Packages: Computer ScienceComputer Science (R0)