research-article

Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems

Authors:

Afonso Matheus Sousa Lima,

Marilia Soares Mendes,

Lívia Almada CruzAuthors Info & Claims

SBSI '19: Proceedings of the XV Brazilian Symposium on Information Systems

Article No.: 10, Pages 1 - 8

https://doi.org/10.1145/3330204.3330219

Published: 20 May 2019 Publication History

Abstract

This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.

References

[1]

Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31.

Digital Library

[2]

Tawunrat Chalothorn and Jeremy Ellman. 2012. Sentiment analysis of web forums: Comparison between sentiwordnet and sentistrength. The 4th International Conference on Computer Technology and Development (ICCTD 2012). 24-25 November 2012.

[3]

Thiago Hellen O da Silva, Lavínia Matoso Freitas, and Marília Soares Mendes. 2017. Beyond traditional evaluations: user's view in app stores. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems. ACM, 15.

Digital Library

[4]

JL De Lucca and Maria das Graças Volpe Nunes. 2002. Lematização versus Stemming. USP, UFSCar, UNESP, São Carlos, São Paulo (2002).

[5]

Steffen Hedegaard and Jakob Grue Simonsen. 2013. Extracting usability and user experience information from online user reviews. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2089--2098.

Digital Library

[6]

Hannu Korhonen, Juha Arrasvuori, and Kaisa Väänänen-Vainio-Mattila. 2010. Let users tell the story: evaluating user experience with experience reports. In CHI'10 Extended Abstracts on Human Factors in Computing Systems. ACM, 4051--4056.

Digital Library

[7]

Afonso Matheus Sousa Lima, Paloma Bispo dos Santos Silva Silva, Lívia Almada Cruz, and Marilia Soares Mendes. 2017. Investigating the polarity of user postings in a Social System. In International Conference on Social Computing and Social Media. Springer, 246--257.

[8]

Steven Loria, P Keen, M Honnibal, R Yankovsky, D Karesh, E Dempsey, et al. 2014. Textblob: simplified text processing. Secondary TextBlob: Simplified Text Processing (2014).

[9]

Marilia S. Mendes. 2015. MALTU -- Um modelo para avaliação da interação em sistemas sociais a partir da linguagem textual do usuário. Ph.D. Dissertation. Universidade Federal do Ceará, Programa de Pós-Graduação em Ciência da Computação, Fortaleza.

[10]

Marilia S Mendes and Elizabeth Furtado. 2018. An Experience of Textual Evaluation Using the MALTU Methodology. In International Conference on Social Computing and Social Media. Springer, 236--246.

[11]

Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2014. How do users express their emotions regarding the social system in use? A classification of their postings by using the emotional analysis of Norman. In International Conference on Social Computing and Social Media. Springer, 229--241.

Digital Library

[12]

Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2015. Investigating Usability and User Experience from the user postings in Social Systems. In International Conference on Social Computing and Social Media. Springer, 216--228.

[13]

Marília Soares Mendes and Elizabeth Sucupira Furtado. 2017. UUX-Posts: a tool for extracting and classifying postings related to the use of a system. In Proceedings of the 8th Latin American Conference on Human-Computer Interaction. ACM, 2.

Digital Library

[14]

Joel Larocca Neto, Alexandre D Santos, Celso AA Kaestner, Neto Alexandre, D Santos, et al. 2000. Document clustering and text summarization. (2000).

[15]

Developers of Scrapy. 2016. Scrapy 1.5 documentation. https://docs.scrapy.org/en/latest/

[16]

Thomas Olsson and Markus Salo. 2012. Narratives of satisfying and unsatisfying experiences of current mobile augmented reality applications. In Proceedings of the SIGCHI conference on human factors in computing systems. ACM, 2779--2788.

Digital Library

[17]

Viviane Orengo and Christian Huyck. 2001. A stemming algorithmm for the portuguese language. In spire. IEEE, 0186.

[18]

Timo Partala and Aleksi Kallinen. 2011. Understanding the most satisfying and unsatisfying user experiences: Emotions, psychological needs, and context. Interacting with computers 24, 1 (2011), 25--34.

Digital Library

[19]

Juan Ramos et al. 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. 133--142.

[20]

Vitor Rolim, Rafael Ferreira, and Evandro Costa. 2016. Identificação Automática de Dúvidas em Fóruns Educacionais. In Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), Vol. 27. Sociedade Brasileira de Computação, Uberlândia, 936.

[21]

Gayane Shalunts, Gerhard Backfried, and Prinz Prinz. 2014. Sentiment analysis of German social media data for natural disasters. In ISCRAM.

[22]

Mike Thelwall. 2017. The Heart and soul of the web? Sentiment strength detection in the social web with SentiStrength. In Cyberemotions. Springer, 119--134.

[23]

Bruno Trstenjak, Sasa Mikac, and Dzenana Donko. 2014. KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering 69 (2014), 1356--1364.

[24]

Alexandre N Tuch, Rune Trusell, and Kasper Hornbæk. 2013. Analyzing users' narratives to understand experience with interactive products. In Proceedings of the SIGCHI Conference on human factors in computing systems. ACM, 2079--2088.

Digital Library

[25]

David Vilares, Mike Thelwall, and Miguel A Alonso. 2015. The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets. Journal of Information Science 41, 6 (2015), 799--813.

Digital Library

Cited By

Silva PSilva TMendes MFurtado MGuimaraes RSaibel Santos Cde Santana VKronbauer A(2019)Temporal analysis of posts related to useProceedings of the 18th Brazilian Symposium on Human Factors in Computing Systems10.1145/3357155.3358482(1-10)Online publication date: 22-Oct-2019
https://dl.acm.org/doi/10.1145/3357155.3358482

Index Terms

Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Lexical semantics
  2. Machine learning
2. General and reference
  1. Cross-computing tools and techniques
    1. Evaluation

Recommendations

Data-driven integration of multiple sentiment dictionaries for lexicon-based sentiment classification of product reviews

In lexicon-based sentiment classification, the problem of contextual polarity must be explicitly handled since it is a major cause for classification error. One way to handle contextual polarity is to revise the prior polarity of the sentiment ...
An automatic non-English sentiment lexicon builder using unannotated corpus

Sentiment lexicons in the English language are widely accessible while in many other languages, these resources are extremely deficient. Current techniques and methods for sentiment analysis focus mainly on the English language, whereas other languages ...
Brand-Related Events Detection, Classification and Summarization on Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01

The huge and ever increasing amount of text generated by Twitter users everyday embeds a wealth of information, in particular, about themes that become suddenly relevant to many users as well as about the sentiment polarity that users tend to associate ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

SBSI '19: Proceedings of the XV Brazilian Symposium on Information Systems

May 2019

623 pages

ISBN:9781450372374

DOI:10.1145/3330204

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

SBC: Brazilian Computer Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

SBSI'19

SBSI'19: XV Brazilian Symposium on Information Systems

May 20 - 24, 2019

Aracaju, Brazil

Acceptance Rates

Overall Acceptance Rate 181 of 557 submissions, 32%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
55
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Silva PSilva TMendes MFurtado MGuimaraes RSaibel Santos Cde Santana VKronbauer A(2019)Temporal analysis of posts related to useProceedings of the 18th Brazilian Symposium on Human Factors in Computing Systems10.1145/3357155.3358482(1-10)Online publication date: 22-Oct-2019
https://dl.acm.org/doi/10.1145/3357155.3358482

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten