research-article

Open Source License Inconsistencies on GitHub

Authors:

Thomas Wolter,

Ann Barcomb,

Dirk Riehle,

Nikolay HarutyunyanAuthors Info & Claims

ACM Transactions on Software Engineering and Methodology, Volume 32, Issue 5

Article No.: 110, Pages 1 - 23

https://doi.org/10.1145/3571852

Published: 22 July 2023 Publication History

Get Access

Abstract

Almost all software, open or closed, builds on open source software and therefore needs to comply with the license obligations of the open source code. Not knowing which licenses to comply with poses a legal danger to anyone using open source software. This article investigates the extent of inconsistencies between licenses declared by an open source project at the top level of the repository and the licenses found in the code. We analyzed a sample of 1,000 open source GitHub repositories. We find that about half of the repositories did not fully declare all licenses found in the code. Of these, approximately 10% represented a permissive vs. copyleft license mismatch. Furthermore, existing tools cannot fully identify licences. We conclude that users of open source code should not just look at the declared licenses of the open source code they intend to use, but rather examine the software to understand its actual licenses.

Appendix

A Statistics

Table A.1.

	statistic	p-value
size	185.1912	0.0000
stargazers_count	84.9197	0.0000
subscribers_count	101.1065	0.0000
forks_count	103.7719	0.0000
open_issues_count	133.4003	0.0000
fork	10.3001	0.0162
has_issues	6.4375	0.0922
has_projects	11.2455	0.0105
has_downloads	1.2762	0.7348
has_wiki	19.1209	0.0003
has_pages	15.0404	0.0018

Table A.1. Kruskal-Wallis Correlation calculations for Nomos

Table A.2.

	statistic	p-value
size	178.1689	0.0000
stargazers_count	90.8613	0.0000
subscribers_count	106.4716	0.0000
forks_count	112.1345	0.0000
open_issues_count	141.2797	0.0000
fork	3.35069	0.34065
has_issues	3.87766	0.27498
has_projects	12.22392	0.00665
has_downloads	1.03154	0.79362
has_wiki	19.73876	0.00019
has_pages	9.92883	0.01918

Table A.2. Kruskal-Wallis Correlation calculations for ScanCode

Table A.3.

	statistic	p-value
size	200.8115	0.0000
stargazers_count	87.3538	0.0000
subscribers_count	98.3808	0.0000
forks_count	107.8177	0.0000
open_issues_count	133.2991	0.0000
fork	10.5591	0.0144
has_issues	6.587	0.0863
has_projects	11.4895	0.0094
has_downloads	1.3913	0.7076
has_wiki	19.0357	0.0003
has_pages	14.7405	0.0021

Table A.3. Kruskal-Wallis Correlation calculations for hybrid

References

[1]

Pär J. Ågerfalk and Brian Fitzgerald. 2008. Outsourcing to an unknown workforce: Exploring opensurcing as a global sourcing strategy. MIS Quarterly 32, 2 (2008), 385–409.

Abstract

A Statistics

References

Cited By

Index Terms

Recommendations

LiResolver: License Incompatibility Resolution for Open Source Software

Choosing an Open Source License

Open source license alternatives for software applications: is it a solution to stop software piracy?

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Full Text

Share

Share this Publication link

Share on social media

Affiliations