skip to main content
10.1145/3034950.3034957acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmssConference Proceedingsconference-collections
research-article

Success or Failure Identification for GitHub's Open Source Projects

Authors Info & Claims
Published:14 January 2017Publication History

ABSTRACT

In this research we have tried to identify successful and unsuccessful projects on GitHub from a sample of 5000 randomly picked projects in a number of randomly selected languages (Java, PHP, JavaScript, C#/C++, HTML). We have selected 1000 projects for each of these languages through the publicly available GitHub API, refined our dataset, and applied different machine learning algorithms to achieve our aim. We initially implemented numerous queries against the dataset and found meaningful relationships and correlations between some of the fetched attributes which have an effect on the popularity of these projects. Later we could develop an application that will determine the success or failure of a specific open source project.

References

  1. Jaeger, Till, and Axel Metzger. 2006. Open-source-Software: Rechtliche Rahmenbedingungen Der Freien Software. München: Beck, 2006. Print.Google ScholarGoogle Scholar
  2. Pyle, Dorian. Data Preparation for Data Mining. San Francisco, CA: Morgan Kaufmann, 1999. Print Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Keys to open source success: http://www.itworld.com/article/2694496/cloud-computing/3-keys-to-open-source-success.html.Google ScholarGoogle Scholar
  4. Forbes, 2016. Time distribution of Data Scientists.Google ScholarGoogle Scholar
  5. "What Is the Truck-Factor of Popular GitHub Applications?" Hacker News (Puch Cool). N.p., n.d. Web. 21 Nov. 2016.Google ScholarGoogle Scholar
  6. Istiyanto, J. E., and Wahju Rahardjo Emanuel, A. 2009. Success Factors of Open Source Software Projects using Datamining Technique. 1st Information and Communication Technology International Seminar, July 2009. ISSN 2085-692XGoogle ScholarGoogle Scholar
  7. Midha, V., and Palvia, P. 2012. Factors affecting the success of Open Source Software. Journal of Systems and Software 85 (4), (2012), 895--905. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. AnnaLiisa Mattila., and Tanja Mehtonen. Measuring Open Source Software Success and Recognising Success Factors For introductionsGoogle ScholarGoogle Scholar
  9. Guerrouj, L., Azad, S., and Rigby, P. C. 2015. The influence of App churns on App success and StackOverflow discussions. In Y.-G. Guéhéneuc, B. Adams & A. Serebrenik (eds.), SANER (pp. 321--330),: IEEE. ISBN: 978-1-4799-8469-5Google ScholarGoogle Scholar
  10. The 13th Working Conference on Mining Software Repositories - http://2016.msrconf.orgGoogle ScholarGoogle Scholar
  11. Offline mirror of GitHub repositories - http://ghtorrent.org/Google ScholarGoogle Scholar
  12. Project Link- http://junaidmaqsood.com/success-or-failure-identification-for-githubs-open-source-projectsGoogle ScholarGoogle Scholar
  13. Weka Tool - http://www.cs.waikato.ac.nz/ml/weka/Google ScholarGoogle Scholar
  14. Maqsood, J. {dot}Net Library for SMS Spam Detection, http://junaidmaqsood.com/dot-net-library-for-sms-spam-detectionGoogle ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICMSS '17: Proceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences
    January 2017
    339 pages
    ISBN:9781450348348
    DOI:10.1145/3034950

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 14 January 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader