skip to main content
10.1145/3106426.3106480acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

Collaboration strength metrics and analyses on GitHub

Published: 23 August 2017 Publication History

Abstract

We perform social analyses over an important community: the open code collaboration network. Specifically, we study the correlation among features that measure the strength of social coding collaboration on GitHub - a Web-based source code repository that can be modeled as a social coding network. We also make publicly available a curated dataset called GitSED, GitHub Socially Enhanced Dataset. Our results have many practical applications such as to improve the recommendation of developers, the evaluation of team formation and existing analysis algorithms.

References

[1]
Lada A. Adamic and Eytan Adar. 2003. Friends and neighbors on the Web. Social Networks 25, 3 (2003), 211 -- 230.
[2]
Luca Maria Aiello, Rossano Schifanella, and Bogdan State. 2014. Reading the source code of social ties. In WebSci. Bloomington, IN, USA, 139--148.
[3]
Gabriela B. Alves et al. 2016. The Strength of Social Coding Collaboration on GitHub. In SBBD. Salvador, Brazil, 247--252.
[4]
Albert-László Barabási and Réka Albert. 1999. Emergence of scaling in random networks. Science 286, 5439 (1999), 509--512.
[5]
Jacob G. Barnett et al. 2016. The Relationship Between Commit Message Detail and Defect Proneness in Java Projects on GitHub. In MSR. Austin, USA, 496--499.
[6]
Roman Bartusiak et al. 2016. Cooperation prediction in GitHub developers network with restricted Boltzmann machine. In ACIIDS. Vietnam, 96--107.
[7]
Nicolas Bettenburg and Ahmed E Hassan. 2010. Studying the impact of social structures on software quality. In ICPC. Braga, Portugal, 124--133.
[8]
Michele A. Brandão and Mirella M. Moro. 2015. Analyzing the Strength of Co-authorship Ties with Neighborhood Overlap. In DEXA. Valencia, Spain, 527--542.
[9]
Michele A. Brandão and Mirella M. Moro. 2017. Social professional networks: A survey and taxonomy. Computer Communications 100 (2017), 20--31.
[10]
Michele A Brandão and Mirella M Moro. 2017. The strength of co-authorship ties through different topological properties. J. Braz. Comp. Society 23, 1 (2017), 5.
[11]
Casey Casalnuovo et al. 2015. Developer Onboarding in GitHub: The Role of Prior Social Links and Language Experience. In ESEC/FSE. Bergamo, Italy, 817--828.
[12]
Valerio Cosentino, Javier Luis, and Jordi Cabot. 2016. Findings from github: Methods, datasets and limitations. In MSR. Austin, USA, 137--141.
[13]
Laura Dabbish et al. 2012. Social coding in GitHub: transparency and collaboration in an open software repository. In CSCW. Seattle, WA, USA, 1277--1286.
[14]
David Easley and Jon Kleinberg. 2010. Networks, crowds, and markets: Reasoning about a highly connected world. Cambridge University Press.
[15]
G. Gousios. 2013. The GHTorrent dataset and tool suite. In MSR. San Francisco, USA, 233--236.
[16]
M. S. Granovetter. 1973. The strength of weak ties. The American Journal of Sociology 78, 6 (1973), 1360--1380.
[17]
Abigail Z. Jacobs et al. 2015. Assembling the facebook: Using Heterogeneity to Understand Online Social Network Assembly. In WebSci. Oxford, UK, 18:1--18:10.
[18]
Oskar Jarczyk et al. 2014. On the Effectiveness of Emergent Task Allocation of Virtual Programmer Teams. In IEEE/WIC/ACM WI. Warsaw, Poland, 369--376.
[19]
Mary Beth Kery, Claire Le Goues, and Brad A. Myers. 2016. Examining Programmer Practices for Locally Handling Exceptions. In MSR. Austin, USA, 484--487.
[20]
David Liben-Nowell and Jon Kleinberg. 2003. The Link Prediction Problem for Social Networks. In CIKM. New Orleans, USA, 556--559.
[21]
Pablo Loyola and In-Young Ko. 2012. Biological Mutualistic Models Applied to Study Open Source Software Development. In IEEE/WIC/ACM WI, Vol. 1. Macau, China, 248--253.
[22]
Radoslaw Nielek et al. 2016. Choose a Job You Love: Predicting Choices of GitHub Developers. In IEEE/WIC/ACM WI. Omaha, NE, USA, 200--207.
[23]
Rohan Padhye, Senthil Mani, and Vibha Singhal Sinha. 2014. A study of external community contribution to open-source projects on GitHub. In MSR. Hyderabad, India, 332--335.
[24]
Jason Tsay, Laura Dabbish, and James Herbsleb. 2014. Influence of Social and Technical Factors for Evaluating Contribution in GitHub. In ICSE. Hyderabad, India, 356--366.
[25]
Bogdan Vasilescu, Alexander Serebrenik, and Vladimir Filkov. 2015. A Data Set for Social Diversity Studies of GitHub Teams. In MSR. Florence, Italy, 514--517.
[26]
Katrin Weller and Katharina E. Kinder-Kurlanda. 2016. A manifesto for data sharing in social media research. In WebSci. Hannover, Germany, 166--172.
[27]
Morteza Zihayat, Mehdi Kargar, and Aijun An. 2014. Two-phase pareto set discovery for team formation in social networks. In IEEE/WIC/ACM WI, Vol. 2. Warsaw, Poland, 304--311.

Cited By

View all
  • (2024)The vital role of community in open source software developmentJournal of Software: Evolution and Process10.1002/smr.264336:7Online publication date: 14-Jul-2024
  • (2023)Understanding the Data Needs for Developing a Computational Model of Team Dynamics2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343263(1-9)Online publication date: 18-Oct-2023
  • (2023)Relationships Between Social Interactions and Belbin Role Types in Collaborative Agile TeamsIEEE Access10.1109/ACCESS.2023.324532511(17002-17020)Online publication date: 2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WI '17: Proceedings of the International Conference on Web Intelligence
August 2017
1284 pages
ISBN:9781450349512
DOI:10.1145/3106426
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. online cooperative work
  2. social network analysis
  3. tie strength

Qualifiers

  • Research-article

Funding Sources

Conference

WI '17
Sponsor:

Acceptance Rates

WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
Overall Acceptance Rate 118 of 178 submissions, 66%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)2
Reflects downloads up to 30 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The vital role of community in open source software developmentJournal of Software: Evolution and Process10.1002/smr.264336:7Online publication date: 14-Jul-2024
  • (2023)Understanding the Data Needs for Developing a Computational Model of Team Dynamics2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343263(1-9)Online publication date: 18-Oct-2023
  • (2023)Relationships Between Social Interactions and Belbin Role Types in Collaborative Agile TeamsIEEE Access10.1109/ACCESS.2023.324532511(17002-17020)Online publication date: 2023
  • (2023)A community detection approach based on network representation learning for repository miningExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120597231:COnline publication date: 30-Nov-2023
  • (2022)Open Source Software Development ChallengesResearch Anthology on Agile Software, Software Development, and Testing10.4018/978-1-6684-3702-5.ch102(2134-2164)Online publication date: 2022
  • (2022)A systematic process for Mining Software RepositoriesInformation and Software Technology10.1016/j.infsof.2021.106791144:COnline publication date: 9-May-2022
  • (2022)How do developers collaborate? Investigating GitHub heterogeneous networksSoftware Quality Journal10.1007/s11219-022-09598-x31:1(211-241)Online publication date: 7-Sep-2022
  • (2021)Open Source Software Development ChallengesResearch Anthology on Usage and Development of Open Source Software10.4018/978-1-7998-9158-1.ch003(33-62)Online publication date: 2021
  • (2020)Data from Multiple Web Sources: Crawling, Integrating, Preprocessing, and Designing ApplicationsSpecial Topics in Multimedia, IoT and Web Technologies10.1007/978-3-030-35102-1_8(213-242)Online publication date: 3-Mar-2020
  • (2018)Dealing with Data from Multiple Web SourcesProceedings of the 24th Brazilian Symposium on Multimedia and the Web10.1145/3243082.3264609(3-6)Online publication date: 16-Oct-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media