ABSTRACT
This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.
- Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31. Google ScholarDigital Library
- Tawunrat Chalothorn and Jeremy Ellman. 2012. Sentiment analysis of web forums: Comparison between sentiwordnet and sentistrength. The 4th International Conference on Computer Technology and Development (ICCTD 2012). 24-25 November 2012.Google ScholarCross Ref
- Thiago Hellen O da Silva, Lavínia Matoso Freitas, and Marília Soares Mendes. 2017. Beyond traditional evaluations: user's view in app stores. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems. ACM, 15. Google ScholarDigital Library
- JL De Lucca and Maria das Graças Volpe Nunes. 2002. Lematização versus Stemming. USP, UFSCar, UNESP, São Carlos, São Paulo (2002).Google Scholar
- Steffen Hedegaard and Jakob Grue Simonsen. 2013. Extracting usability and user experience information from online user reviews. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2089--2098. Google ScholarDigital Library
- Hannu Korhonen, Juha Arrasvuori, and Kaisa Väänänen-Vainio-Mattila. 2010. Let users tell the story: evaluating user experience with experience reports. In CHI'10 Extended Abstracts on Human Factors in Computing Systems. ACM, 4051--4056. Google ScholarDigital Library
- Afonso Matheus Sousa Lima, Paloma Bispo dos Santos Silva Silva, Lívia Almada Cruz, and Marilia Soares Mendes. 2017. Investigating the polarity of user postings in a Social System. In International Conference on Social Computing and Social Media. Springer, 246--257.Google Scholar
- Steven Loria, P Keen, M Honnibal, R Yankovsky, D Karesh, E Dempsey, et al. 2014. Textblob: simplified text processing. Secondary TextBlob: Simplified Text Processing (2014).Google Scholar
- Marilia S. Mendes. 2015. MALTU -- Um modelo para avaliação da interação em sistemas sociais a partir da linguagem textual do usuário. Ph.D. Dissertation. Universidade Federal do Ceará, Programa de Pós-Graduação em Ciência da Computação, Fortaleza.Google Scholar
- Marilia S Mendes and Elizabeth Furtado. 2018. An Experience of Textual Evaluation Using the MALTU Methodology. In International Conference on Social Computing and Social Media. Springer, 236--246.Google Scholar
- Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2014. How do users express their emotions regarding the social system in use? A classification of their postings by using the emotional analysis of Norman. In International Conference on Social Computing and Social Media. Springer, 229--241. Google ScholarDigital Library
- Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2015. Investigating Usability and User Experience from the user postings in Social Systems. In International Conference on Social Computing and Social Media. Springer, 216--228.Google ScholarCross Ref
- Marília Soares Mendes and Elizabeth Sucupira Furtado. 2017. UUX-Posts: a tool for extracting and classifying postings related to the use of a system. In Proceedings of the 8th Latin American Conference on Human-Computer Interaction. ACM, 2. Google ScholarDigital Library
- Joel Larocca Neto, Alexandre D Santos, Celso AA Kaestner, Neto Alexandre, D Santos, et al. 2000. Document clustering and text summarization. (2000).Google Scholar
- Developers of Scrapy. 2016. Scrapy 1.5 documentation. https://docs.scrapy.org/en/latest/Google Scholar
- Thomas Olsson and Markus Salo. 2012. Narratives of satisfying and unsatisfying experiences of current mobile augmented reality applications. In Proceedings of the SIGCHI conference on human factors in computing systems. ACM, 2779--2788. Google ScholarDigital Library
- Viviane Orengo and Christian Huyck. 2001. A stemming algorithmm for the portuguese language. In spire. IEEE, 0186.Google Scholar
- Timo Partala and Aleksi Kallinen. 2011. Understanding the most satisfying and unsatisfying user experiences: Emotions, psychological needs, and context. Interacting with computers 24, 1 (2011), 25--34. Google ScholarDigital Library
- Juan Ramos et al. 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. 133--142.Google Scholar
- Vitor Rolim, Rafael Ferreira, and Evandro Costa. 2016. Identificação Automática de Dúvidas em Fóruns Educacionais. In Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), Vol. 27. Sociedade Brasileira de Computação, Uberlândia, 936.Google ScholarCross Ref
- Gayane Shalunts, Gerhard Backfried, and Prinz Prinz. 2014. Sentiment analysis of German social media data for natural disasters.. In ISCRAM.Google Scholar
- Mike Thelwall. 2017. The Heart and soul of the web? Sentiment strength detection in the social web with SentiStrength. In Cyberemotions. Springer, 119--134.Google Scholar
- Bruno Trstenjak, Sasa Mikac, and Dzenana Donko. 2014. KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering 69 (2014), 1356--1364.Google ScholarCross Ref
- Alexandre N Tuch, Rune Trusell, and Kasper Hornbæk. 2013. Analyzing users' narratives to understand experience with interactive products. In Proceedings of the SIGCHI Conference on human factors in computing systems. ACM, 2079--2088. Google ScholarDigital Library
- David Vilares, Mike Thelwall, and Miguel A Alonso. 2015. The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets. Journal of Information Science 41, 6 (2015), 799--813. Google ScholarDigital Library
Index Terms
- Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems
Recommendations
Data-driven integration of multiple sentiment dictionaries for lexicon-based sentiment classification of product reviews
In lexicon-based sentiment classification, the problem of contextual polarity must be explicitly handled since it is a major cause for classification error. One way to handle contextual polarity is to revise the prior polarity of the sentiment ...
An automatic non-English sentiment lexicon builder using unannotated corpus
Sentiment lexicons in the English language are widely accessible while in many other languages, these resources are extremely deficient. Current techniques and methods for sentiment analysis focus mainly on the English language, whereas other languages ...
Brand-Related Events Detection, Classification and Summarization on Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01The huge and ever increasing amount of text generated by Twitter users everyday embeds a wealth of information, in particular, about themes that become suddenly relevant to many users as well as about the sentiment polarity that users tend to associate ...
Comments