ABSTRACT
In this data paper we describe a data set obtained by means of performing an on-line survey to over 2,000 Free Libre Open Source Software (FLOSS) contributors. The survey includes questions related to personal characteristics (gender, age, civil status, nationality, etc.), education and level of English, professional status, dedication to FLOSS projects, reasons and motivations, involvement and goals. We describe as well the possibilities and challenges of using private information from the survey when linked with other, publicly available data sources. In this regard, an example of data sharing will be presented and legal, ethical and technical issues will be discussed.
- J. Bethlehem. How accurate are self-selection web surveys? Technical Report Discussion paper (08014), Statistics Netherlands, 2008.Google Scholar
- P. A. David, A. Waterman, and S. Arora. FLOSS-US: The Free/Libre/Open Source Software Survey for 2003. Technical report, Stanford Institute for Economic and Policy Research, Stanford, USA, 2003.Google Scholar
- R. A. Ghosh, G. Robles, and R. Glott. Software source code survey (free/libre and open source software: Survey and study). Technical report, Univ. of Maastricht, The Netherlands, June 2002. http://www.flossproject.org/report.Google Scholar
- N. Li, T. Li, and S. Venkatasubramanian. t-closeness: Privacy beyond k-anonymity and l-diversity. In ICDE, volume 7, pages 106–115, 2007.Google ScholarCross Ref
- A. Machanavajjhala, D. Kifer, J. Gehrke, and M. Venkitasubramaniam. l-diversity: Privacy beyond k-anonymity. ACM Transactions on Knowledge Discovery from Data (TKDD), 1(1):3, 2007. Google ScholarDigital Library
- P. Ohm. Broken promises of privacy: Responding to the surprising failure of anonymization. UCLA Law Review, 57(6), 2010.Google Scholar
- P. Puhani. The Heckman correction for sample selection and its critique. Journal of economic surveys, 14(1):53–68, 2000.Google Scholar
- H. Shimizu, J. Iio, and K. Hiyane. The realities of Free/Libre/Open Source Software developers in Japan and Asia. First Monday, 9(11), 2004.Google Scholar
- L. Sweeney. k-anonymity: A model for protecting privacy. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 10(05):557–570, 2002. Google ScholarDigital Library
- B. Vasilescu, A. Capiluppi, and A. Serebrenik. Gender, representation and online participation: A quantitative study. Interacting with Computers, page iwt047, 2013.Google Scholar
- B. Vasilescu, V. Filkov, and A. Serebrenik. StackOverflow and GitHub: Associations between software development and crowdsourced knowledge. In Proceedings 2013 Intl Conf on Social Computing, pages 188–195. IEEE, 2013. Google ScholarDigital Library
- H.-Y. Wang and Y.-S. Wang. Gender differences in the perception and acceptance of online games. British Journal of Educational Tech, 39(5):787–806, 2008.Google Scholar
Index Terms
- FLOSS 2013: a survey dataset about free software contributors: challenges for curating, sharing, and combining
Recommendations
The Debsources Dataset: two decades of free and open source software
We present the Debsources Dataset: source code and related metadata spanning two decades of Free and Open Source Software (FOSS) history, seen through the lens of the Debian distribution. The dataset spans more than 3 billion lines of source code as ...
Open source license alternatives for software applications: is it a solution to stop software piracy?
ACM-SE 43: Proceedings of the 43rd annual Southeast regional conference - Volume 2The open source movement has introduced a wealth of software applications that may challenge commercial applications in ease of use, features, and speed. Typically open source applications are available "free-of-charge", but the potential for hidden ...
Do software developers understand open source licenses?
ICPC '17: Proceedings of the 25th International Conference on Program ComprehensionSoftware provided under open source licenses is widely used, from forming high-profile stand-alone applications (e.g., Mozilla Firefox) to being embedded in commercial offerings (e.g., network routers). Despite the high frequency of use of open source ...
Comments