skip to main content
10.1145/3194932.3194938acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Sentiment and politeness analysis tools on developer discussions are unreliable, but so are people

Published:02 June 2018Publication History

ABSTRACT

Many software engineering researchers use sentiment and politeness analysis tools to study the emotional environment within collaborative software development. However, papers that use these tools rarely establish their reliability. In this paper, we evaluate popular existing tools for sentiment and politeness detection over a dataset of 589 manually rated GitHub comments that represent developer discussions. We also develop a coding scheme on how to quantify politeness for conversational texts found on collaborative platforms. We find that not only do the tools have a low agreement with human ratings on sentiment and politeness, human raters also have a low agreement among themselves.

References

  1. Toufique Ahmed, Amiangshu Bosu, Anindya Iqbal, and Shahram Rahimi. 2017. SentiCR: a customized sentiment analysis tool for code review interactions. In Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering. IEEE Press, 106--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural language processing with Python: analyzing text with the natural language toolkit. "O'Reilly Media, Inc.". Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cássio Castaldi Araujo Blaz and Karin Becker. 2016. Sentiment analysis in tickets for it support. In Mining Software Repositories (MSR), 2016 IEEE/ACM 13th Working Conference on. IEEE, 235--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chris Brown, Justin Middleton, Esha Sharma, and Emerson Murphy-Hill. {n. d.}. How Software Users Recommend Tools to Each Other. ({n. d.}).Google ScholarGoogle Scholar
  5. Penelope Brown and Stephen C Levinson. 1987. Politeness: Some universals in language usage. Vol. 4. Cambridge university press.Google ScholarGoogle ScholarCross RefCross Ref
  6. Fabio Calefato, Filippo Lanubile, Federico Maiorano, and Nicole Novielli. 2017. Sentiment Polarity Detection for Software Development. Empirical Software Engineering (2017), 1--31. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Wikipedia contributors. 2018. Politeness --- Wikipedia, The Free Encyclopedia. (2018). https://en.wikipedia.org/wiki/Politeness {Online; accessed 18 Jan, 2018}.Google ScholarGoogle Scholar
  8. Jonathan Culpeper. 1996. Towards an anatomy of impoliteness. Journal of pragmatics 25, 3 (1996), 349--367.Google ScholarGoogle ScholarCross RefCross Ref
  9. Bill Curtis, Herb Krasner, and Neil Iscoe. 1988. A field study of the software design process for large systems. Commun. ACM 31, 11 (1988), 1268--1287. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, and Christopher Potts. 2013. A computational approach to politeness with application to social factors. arXiv preprint arXiv:1306.6078 (2013).Google ScholarGoogle Scholar
  11. Prasun Dewan. 2015. Towards emotion-based collaborative software engineering. In Proceedings of the Eighth International Workshop on Cooperative and Human Aspects of Software Engineering. IEEE Press, 109--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Felipe Ebert, Fernando Castor, Nicole Novielli, and Alexander Serebrenik. 2017. Confusion detection in code reviews. In Software Maintenance and Evolution (ICSME), 2017 IEEE International Conference on. IEEE, 549--553.Google ScholarGoogle ScholarCross RefCross Ref
  13. Daviti Gachechiladze, Filippo Lanubile, Nicole Novielli, and Alexander Serebrenik. 2017. Anger and its direction in collaborative software development. In Proceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track. IEEE Press, 11--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Michael Gamon, Anthony Aue, Simon Corston-Oliver, and Eric Ringger. 2005. Pulse: Mining customer opinions from free text. Lecture notes in computer science 3646 (2005), 121--132. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. David Garcia, Marcelo Serrano Zanetti, and Frank Schweitzer. 2013. The role of emotions in contributors activity: A case study on the gentoo community. In Cloud and green computing (CGC), 2013 third international conference on. IEEE, 410--417. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Georgios Gousios, Bogdan Vasilescu, Alexander Serebrenik, and Andy Zaidman. 2014. Lean GHTorrent: GitHub data on demand. In Proceedings of the 11th working conference on mining software repositories. ACM, 384--387. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Daniel Graziotin, Xiaofeng Wang, and Pekka Abrahamsson. 2014. Happy software developers solve problems better: psychological measurements in empirical software engineering. PeerJ 2 (2014), e289.Google ScholarGoogle ScholarCross RefCross Ref
  18. Emitza Guzman, David Azócar, and Yang Li. 2014. Sentiment analysis of commit comments in GitHub: an empirical study. In Proceedings of the 11th Working Conference on Mining Software Repositories. ACM, 352--355. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Emitza Guzman and Bernd Bruegge. 2013. Towards emotional awareness in software development teams. In Proceedings of the 2013 9th joint meeting on foundations of software engineering. ACM, 671--674. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Md Rakibul Islam and Minhaz F Zibran. 2016. Towards understanding and exploiting developers' emotional variations in software engineering. In Software Engineering Research, Management and Applications (SERA), 2016 IEEE 14th International Conference on. IEEE, 185--192.Google ScholarGoogle Scholar
  21. Robbert Jongeling, Proshanta Sarkar, Subhajit Datta, and Alexander Serebrenik. 2017. On negative results when using sentiment analysis tools for software engineering research. Empirical Software Engineering (2017), 1--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Iftikhar Ahmed Khan, Willem-Paul Brinkman, and Robert M Hierons. 2011. Do moods affect programmersâÃŹ debug performance? Cognition, Technology & Work 13, 4 (2011), 245--258. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J Richard Landis and Gary G Koch. 1977. The measurement of observer agreement for categorical data. biometrics (1977), 159--174.Google ScholarGoogle Scholar
  24. Bin Lin, Fiorella Zampetti, Gabriele Bavota, Massimiliano Di Penta, Michele Lanza, and Rocco Oliveto. 2018. Sentiment Analysis for So ware Engineering: How Far Can We Go? (2018).Google ScholarGoogle Scholar
  25. Saif Mohammad. 2016. A Practical Guide to Sentiment Annotation: Challenges and Solutions.. In WASSA@ NAACL-HLT. 174--179.Google ScholarGoogle Scholar
  26. Alessandro Murgia, Parastou Tourani, Bram Adams, and Marco Ortu. 2014. Do developers feel emotions? an exploratory analysis of emotions in software artifacts. In Proceedings of the 11th working conference on mining software repositories. ACM, 262--271. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Nicole Novielli, Fabio Calefato, and Filippo Lanubile. 2014. Towards discovering the role of emotions in stack overflow. In Proceedings of the 6th international workshop on social software engineering. ACM, 33--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Nicole Novielli, Fabio Calefato, and Filippo Lanubile. 2015. The challenges of sentiment detection in the social programmer ecosystem. In Proceedings of the 7th International Workshop on Social Software Engineering. ACM, 33--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Marco Ortu, Bram Adams, Giuseppe Destefanis, Parastou Tourani, Michele Marchesi, and Roberto Tonelli. 2015. Are bullies more productive?: empirical study of affectiveness vs. issue fixing time. In Proceedings of the 12th Working Conference on Mining Software Repositories. IEEE Press, 303--313. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Marco Ortu, Giuseppe Destefanis, Mohamad Kassab, Steve Counsell, Michele Marchesi, and Roberto Tonelli. 2015. Would you mind fixing this issue?. In International Conference on Agile Software Development. Springer, 129--140.Google ScholarGoogle ScholarCross RefCross Ref
  31. Marco Ortu, Alessandro Murgia, Giuseppe Destefanis, Parastou Tourani, Roberto Tonelli, Michele Marchesi, and Bram Adams. 2016. The emotional side of software developers in JIRA. In 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR). IEEE, 480--483. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Bo Pang, Lillian Lee, et al. 2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval 2, 1--2 (2008), 1--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Brian Parkinson, Peter Totterdell, Rob B Briner, and SA Reynolds. 1996. Changing moods: The psychology of mood and mood regulation. Longman.Google ScholarGoogle Scholar
  34. Daniel Pletea, Bogdan Vasilescu, and Alexander Serebrenik. 2014. Security and emotion: sentiment analysis of security discussions on GitHub. In Proceedings of the 11th working conference on mining software repositories. ACM, 348--351. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Athanasios-Ilias Rousinopoulos, Gregorio Robles, and Jesús M González-Barahona. 2014. SENTIMENT ANALYSIS OF FREE/OPEN SOURCE DEVELOPERS: PRELIMINARY FINDINGS FROM A CASE STUDY/ANÁLISE DE SENTIMENTOS DE DESENVOLVEDORES DE SOFTWARE LIVRE: ACHADOS PRELIMINARES DE UM ESTUDO DE CASO. Revista Electronica de Sistemas de Informaçao 13, 2 (2014), 1.Google ScholarGoogle Scholar
  36. Vinayak Sinha, Alina Lazar, and Bonita Sharif. 2016. Analyzing developer sentiment in commit logs. In Proceedings of the 13th International Conference on Mining Software Repositories. ACM, 520--523. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Richard Socher, Alex Perelygin, Jean Wu, Jason Chuang, Christopher D Manning, Andrew Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing. 1631--1642.Google ScholarGoogle Scholar
  38. Igor Steinmacher, Tayana Conte, Marco Aurélio Gerosa, and David Redmiles. 2015. Social barriers faced by newcomers placing their first contribution in open source software projects. In Proceedings of the 18th ACM conference on Computer supported cooperative work & social computing. ACM, 1379--1392. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Margaret-Anne Storey, Christoph Treude, Arie van Deursen, and Li-Te Cheng. 2010. The impact of social media on software engineering practices and tools. In Proceedings of the FSE/SDP workshop on Future of software engineering research. ACM, 359--364. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Di Cai, and Arvid Kappas. 2010. Sentiment strength detection in short informal text. Journal of the Association for Information Science and Technology 61, 12 (2010), 2544--2558. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Jason Tsay, Laura Dabbish, and James Herbsleb. 2014. Let's talk about it: evaluating contributions through discussion in GitHub. In Proceedings of the 22nd ACM SIGSOFT international symposium on foundations of software engineering. ACM, 144--154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Ning Wang, W Lewis Johnson, Richard E Mayer, Paola Rizzo, Erin Shaw, and Heather Collins. 2008. The politeness effect: Pedagogical agents and learning outcomes. International Journal of Human-Computer Studies 66, 2 (2008), 98--112. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    SEmotion '18: Proceedings of the 3rd International Workshop on Emotion Awareness in Software Engineering
    June 2018
    76 pages
    ISBN:9781450357517
    DOI:10.1145/3194932

    Copyright © 2018 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 2 June 2018

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Upcoming Conference

    ICSE 2025

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader