skip to main content
10.1145/3330204.3330219acmotherconferencesArticle/Chapter ViewAbstractPublication PagessbsiConference Proceedingsconference-collections
research-article

Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems

Published:20 May 2019Publication History

ABSTRACT

This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.

References

  1. Steven Bird and Edward Loper. 2004. NLTK: the natural language toolkit. In Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 31. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Tawunrat Chalothorn and Jeremy Ellman. 2012. Sentiment analysis of web forums: Comparison between sentiwordnet and sentistrength. The 4th International Conference on Computer Technology and Development (ICCTD 2012). 24-25 November 2012.Google ScholarGoogle ScholarCross RefCross Ref
  3. Thiago Hellen O da Silva, Lavínia Matoso Freitas, and Marília Soares Mendes. 2017. Beyond traditional evaluations: user's view in app stores. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems. ACM, 15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. JL De Lucca and Maria das Graças Volpe Nunes. 2002. Lematização versus Stemming. USP, UFSCar, UNESP, São Carlos, São Paulo (2002).Google ScholarGoogle Scholar
  5. Steffen Hedegaard and Jakob Grue Simonsen. 2013. Extracting usability and user experience information from online user reviews. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2089--2098. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hannu Korhonen, Juha Arrasvuori, and Kaisa Väänänen-Vainio-Mattila. 2010. Let users tell the story: evaluating user experience with experience reports. In CHI'10 Extended Abstracts on Human Factors in Computing Systems. ACM, 4051--4056. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Afonso Matheus Sousa Lima, Paloma Bispo dos Santos Silva Silva, Lívia Almada Cruz, and Marilia Soares Mendes. 2017. Investigating the polarity of user postings in a Social System. In International Conference on Social Computing and Social Media. Springer, 246--257.Google ScholarGoogle Scholar
  8. Steven Loria, P Keen, M Honnibal, R Yankovsky, D Karesh, E Dempsey, et al. 2014. Textblob: simplified text processing. Secondary TextBlob: Simplified Text Processing (2014).Google ScholarGoogle Scholar
  9. Marilia S. Mendes. 2015. MALTU -- Um modelo para avaliação da interação em sistemas sociais a partir da linguagem textual do usuário. Ph.D. Dissertation. Universidade Federal do Ceará, Programa de Pós-Graduação em Ciência da Computação, Fortaleza.Google ScholarGoogle Scholar
  10. Marilia S Mendes and Elizabeth Furtado. 2018. An Experience of Textual Evaluation Using the MALTU Methodology. In International Conference on Social Computing and Social Media. Springer, 236--246.Google ScholarGoogle Scholar
  11. Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2014. How do users express their emotions regarding the social system in use? A classification of their postings by using the emotional analysis of Norman. In International Conference on Social Computing and Social Media. Springer, 229--241. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Marília S Mendes, Elizabeth Furtado, Vasco Furtado, and Miguel F de Castro. 2015. Investigating Usability and User Experience from the user postings in Social Systems. In International Conference on Social Computing and Social Media. Springer, 216--228.Google ScholarGoogle ScholarCross RefCross Ref
  13. Marília Soares Mendes and Elizabeth Sucupira Furtado. 2017. UUX-Posts: a tool for extracting and classifying postings related to the use of a system. In Proceedings of the 8th Latin American Conference on Human-Computer Interaction. ACM, 2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Joel Larocca Neto, Alexandre D Santos, Celso AA Kaestner, Neto Alexandre, D Santos, et al. 2000. Document clustering and text summarization. (2000).Google ScholarGoogle Scholar
  15. Developers of Scrapy. 2016. Scrapy 1.5 documentation. https://docs.scrapy.org/en/latest/Google ScholarGoogle Scholar
  16. Thomas Olsson and Markus Salo. 2012. Narratives of satisfying and unsatisfying experiences of current mobile augmented reality applications. In Proceedings of the SIGCHI conference on human factors in computing systems. ACM, 2779--2788. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Viviane Orengo and Christian Huyck. 2001. A stemming algorithmm for the portuguese language. In spire. IEEE, 0186.Google ScholarGoogle Scholar
  18. Timo Partala and Aleksi Kallinen. 2011. Understanding the most satisfying and unsatisfying user experiences: Emotions, psychological needs, and context. Interacting with computers 24, 1 (2011), 25--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Juan Ramos et al. 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. 133--142.Google ScholarGoogle Scholar
  20. Vitor Rolim, Rafael Ferreira, and Evandro Costa. 2016. Identificação Automática de Dúvidas em Fóruns Educacionais. In Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), Vol. 27. Sociedade Brasileira de Computação, Uberlândia, 936.Google ScholarGoogle ScholarCross RefCross Ref
  21. Gayane Shalunts, Gerhard Backfried, and Prinz Prinz. 2014. Sentiment analysis of German social media data for natural disasters.. In ISCRAM.Google ScholarGoogle Scholar
  22. Mike Thelwall. 2017. The Heart and soul of the web? Sentiment strength detection in the social web with SentiStrength. In Cyberemotions. Springer, 119--134.Google ScholarGoogle Scholar
  23. Bruno Trstenjak, Sasa Mikac, and Dzenana Donko. 2014. KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering 69 (2014), 1356--1364.Google ScholarGoogle ScholarCross RefCross Ref
  24. Alexandre N Tuch, Rune Trusell, and Kasper Hornbæk. 2013. Analyzing users' narratives to understand experience with interactive products. In Proceedings of the SIGCHI Conference on human factors in computing systems. ACM, 2079--2088. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. David Vilares, Mike Thelwall, and Miguel A Alonso. 2015. The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets. Journal of Information Science 41, 6 (2015), 799--813. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          SBSI '19: Proceedings of the XV Brazilian Symposium on Information Systems
          May 2019
          623 pages
          ISBN:9781450372374
          DOI:10.1145/3330204

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 20 May 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate181of557submissions,32%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader