ABSTRACT
Developers from open-source communities have reported high stress levels from frequent demands for features and bug fixes and from the sometimes aggressive tone of these demands. Toxic conversations may demotivate and burn out developers, creating challenges for sustaining open source. We outline a path toward finding, understanding, and possibly mitigating such unhealthy interactions. We take a first step toward finding them, by developing and demonstrating a measurement instrument (an SVM classifier tailored for software engineering) to detect toxic discussions in GitHub issues. We used our classifier to analyze trends over time and in different GitHub communities, finding that toxicity varies by community and that toxicity decreased between 2012 and 2018.
- Anonymous. Leaving toxic open source communities, 2014. modelviewculture.com/pieces/leaving-toxic-open-source-communities.Google Scholar
- Christopher Bogart, Christian Kästner, James Herbsleb, and Ferdian Thung. How to break an api: Cost negotiation and community values in three software ecosystems. In Proc. FSE, pages 109--120, 2016.Google ScholarDigital Library
- Luke Breitfeller, Emily Ahn, David Jurgens, and Yulia Tsvetkov. Finding microaggressions in the wild: A case for locating elusive phenomena in social media posts. In Proc. Conf. Empirical Methods in Natural Language Processing, 2019.Google ScholarCross Ref
- Fabio Calefato, Filippo Lanubile, Federico Maiorano, and Nicole Novielli. Sentiment polarity detection for software development. Empirical Software Engineering, 23(3):1352--1382, 2018.Google ScholarDigital Library
- Brett Cannon, Adam Stacoviak, and Jerod Santo. The changelog, episode 318: A call for kindness in open source, October 2018. Podcast.Google Scholar
- Laura Dabbish, Colleen Stuart, Jason Tsay, and Jim Herbsleb. Social coding in GitHub: transparency and collaboration in an open software repository. In Proc. CSCW, pages 1277--1286, 2012.Google ScholarDigital Library
- Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, and Christopher Potts. A computational approach to politeness with application to social factors. In Proc. ACL, pages 250--259, 2013.Google Scholar
- Nemanja Djuric, Jing Zhou, Robin Morris, Mihajlo Grbovic, Vladan Radosavljevic, and Narayan Bhamidipati. Hate speech detection with comment embeddings. In Proc. WWW, pages 29--30, 2015.Google ScholarDigital Library
- Nadia Eghbal. Roads and bridges: The unseen labor behind our digital infrastructure. Technical report, Ford Foundation, 2016.Google Scholar
- Tom Fearn. Double cross-validation. NIR news, 21(5):14--15, 2010.Google ScholarCross Ref
- Darja Fišer, Ruihong Huang, Vinodkumar Prabhakaran, Rob Voigt, Zeerak Waseem, and Jacqueline Wernimont, editors. Proc. Works. Abusive Language Online, 2018.Google Scholar
- Paula Fortuna and Sérgio Nunes. A survey on automatic detection of hate speech in text. ACM Computing Surveys (CSUR), 51(4):85, 2018.Google Scholar
- Daviti Gachechiladze, Filippo Lanubile, Nicole Novielli, and Alexander Serebrenik. Anger and its direction in collaborative software development. In Proc. ICSE-NIER, pages 11--14, 2017.Google ScholarDigital Library
- Georgios Gousios. The GHTorrent dataset and tool suite. In Proc. MSR, pages 233--236, 2013.Google Scholar
- Joshua Guberman, Carol Schmitz, and Libby Hemphill. Quantifying toxicity and verbal violence on Twitter. In Proc. CSCW Companion, pages 277--280, 2016.Google ScholarDigital Library
- Emitza Guzman, David Azócar, and Yang Li. Sentiment analysis of commit comments in GitHub: an empirical study. In Proc. MSR, pages 352--355. ACM, 2014.Google ScholarDigital Library
- Eran Hammer. A new social contract for open source, February 2018. Blog post, hueniverse.com/86d1fcf3e353.Google Scholar
- Guido Hertel, Sven Niedner, and Stefanie Herrmann. Motivation of software developers in open source projects: an internet-based survey of contributors to the Linux kernel. Research Policy, 32(7):1159--1177, 2003.Google ScholarCross Ref
- Eric von Hippel and Georg von Krogh. Open source software and the "private-collective" innovation model. Organization Science, 14(2):209--223, 2003.Google ScholarDigital Library
- Clayton J Hutto and Eric Gilbert. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proc. ICWSM, 2014.Google Scholar
- Carlos Jensen, Scott King, and Victor Kuechler. Joining free/open source software communities: An analysis of newbies' first interactions on project mailing lists. In Proc. HICSS, pages 1--10. IEEE, 2011.Google Scholar
- Robbert Jongeling, Proshanta Sarkar, Subhajit Datta, and Alexander Serebrenik. On negative results when using sentiment analysis tools for software engineering research. Empirical Software Engineering, 22(5):2543--2584, 2017.Google ScholarDigital Library
- Dan Jurafsky and James Martin. Speech and language processing. Pearson, 2014.Google ScholarDigital Library
- Sandeep Krishnamurthy. On the intrinsic and extrinsic motivation of floss developers. Knowledge, Technology & Policy, 18(4):17--39, 2006.Google ScholarCross Ref
- Nolan Lawson. What it feels like to be an open-source maintainer, March 2017. Blog post, nolanlawson.com/2017/03/05/.Google Scholar
- Jan Lehnardt. Sustainable open source: The maintainers perspective or: How i learned to stop caring and love open source, March 2017. Blog post, writing.jan.io.Google Scholar
- Pia Mancini et al. Sustain: A one day conversation for open source software sustainers - the report. Technical report, Sustain Conference Organization, 2017.Google Scholar
- Molly K. Maskrey. Career Direction, pages 25--70. Apress, Berkeley, CA, 2016.Google Scholar
- Burt L Monroe, Michael P Colaresi, and Kevin M Quinn. Fightin'words: Lexical feature selection and evaluation for identifying the content of political conflict. Political Analysis, 16(4):372--403, 2008.Google ScholarCross Ref
- Bo Pang, Lillian Lee, et al. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1--2):1--135, 2008.Google Scholar
- Haji Mohammad Saleem, Kelly P Dillon, Susan Benesch, and Derek Ruths. A web of hate: Tackling hateful speech in online social spaces. First Workshop on Text Analytics for Cybersecurity and Online Safety at LREC 2016, 2016.Google Scholar
- Anna Schmidt and Michael Wiegand. A survey on hate speech detection using natural language processing. In Proc. Int'l Works. Natural Language Processing for Social Media, pages 1--10, 2017.Google ScholarCross Ref
- Robert Speer, Joshua Chin, Andrew Lin, Lance Nathan, and Sara Jewett. wordfreq: v1. 5.1, 2016.Google Scholar
- Igor Steinmacher, Marco Aurelio Graciotto Silva, Marco Aurelio Gerosa, and David F Redmiles. A systematic literature review on the barriers faced by newcomers to open source software projects. Information and Software Technology, 59:67--85, 2015.Google ScholarDigital Library
- Yla R Tausczik and James W Pennebaker. The psychological meaning of words: Liwc and computerized text analysis methods. Journal of Language and Social Psychology, 29(1):24--54, 2010.Google ScholarCross Ref
- Zeerak Waseem, Wendy Hui Kyong Chung, Dirk Hovy, and Joel Tetreault, editors. Proc. Works. Abusive Language Online. ACL, 2017.Google Scholar
- Tim Wood. moment().endof('term'), July 2016. Blog post, https://medium.com/timrwood/moment-endof-term-522d8965689.Google Scholar
- Frances Zlotnick. Github open source survey 2017, June 2017. Google ScholarCross Ref
Recommendations
Open source software licenses: Strong-copyleft, non-copyleft, or somewhere in between?
Studies on open source software (OSS) have shown that the license under which an OSS is released has an impact on the success or failure of the software. In this paper, we model the relationship between an OSS developer's utility, the effort that goes ...
Open Source Developer Layer Assessment: Open Onion
Open Source developers play fundamental determinant role in the life of any open source project. This paper investigates developer motivation in contributing tirelessly to an open source project. Open source Onions were investigated and the developer ...
Comments