skip to main content
10.1145/2901739.2903502acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
short-paper

Multi-extract and multi-level dataset of mozilla issue tracking history

Published: 14 May 2016 Publication History

Abstract

any studies analy e issue trac ing repositories to understand and support software development To facilitate the analyses we share a o illa issue trac ing dataset covering a -year history The dataset includes three e tracts and multiple levels for each e tract The three e tracts were retrieved through two channels a front-end we user interface I and a ac -end o cial data ase dump of o illa ug illa at three di erent times The variations dynamics among e tracts provide space for researchers to reproduce and validate their studies while revealing potential opportunities for studies that otherwise could not e conducted e provide di erent data levels for each e tract ranging from raw data to standardi ed data as well as to the calculated data level for targeting speci c research uestions Data retrieving and processing scripts related to each data level are o ered too y employing the multi-level structure analysts can more e ciently start an in uiry from the standardi ed level and easily trace the data chain when necessary e g to verify if a phenomenon re ected y the data is an actual event e applied this dataset to several pu lished studies and intend to e pand the multi-level and multi-e tract feature to other software engineering datasets.

References

[1]
O. Baysal, R. Holmes, and M. W. Godfrey. Revisiting bug triage and resolution practices. In User Evaluation for Software Engineering Researchers (USER), 2012, pages 29--30. IEEE, 2012.
[2]
M. Habayeb, A. Miranskyy, S. S. Murtaza, L. Buchanan, and A. Bener. The firefox temporal defect dataset. In Proceedings of the 12th Working Conference on Mining Software Repositories, pages 498--501. IEEE Press, 2015.
[3]
E. Hill, S. Rao, and A. Kak. On the use of stemming for concern location and bug localization in java. In Source Code Analysis and Manipulation (SCAM), 2012 IEEE 12th International Working Conference on, pages 184--193. IEEE, 2012.
[4]
Howison, J. Conklin, M. Crowston, and Kevin. Flossmole: A collaborative repository for floss research data and analyses. International Journal of Information Technology and Web Engineering, 1(3):17--26, 2008.
[5]
J. P. Ioannidis. Why most published research findings are false. Chance, 18(4):40--47, 2005.
[6]
Y. Kamei, T. Fukushima, S. McIntosh, K. Yamashita, N. Ubayashi, and A. E. Hassan. Studying just-in-time defect prediction using cross-project models. Empirical Software Engineering, pages 1--35, 2015.
[7]
A. Lamkanfi, J. Pérez, and S. Demeyer. The eclipse and mozilla defect tracking dataset: a genuine dataset for mining bug information. In Proceedings of the 10th Working Conference on Mining Software Repositories, pages 203--206. IEEE Press, 2013.
[8]
A. M. Pettigrew. Longitudinal field research on change: Theory and practice. Organization Science, 1(3):267--292, 1990.
[9]
V. Popovici, W. Chen, B. G. Gallas, C. Hatzis, W. Shi, F. W. Samuelson, Y. Nikolsky, M. Tsyganova, A. Ishkin, and T. Nikolskaya. Effect of training-sample size and classification difficulty on the accuracy of genomic predictors. Breast Cancer Research Bcr, 12(1):R5, 2010.
[10]
J. Xie, M. Zhou, and A. Mockus. Impact of triage: a study of mozilla and gnome. In Empirical Software Engineering and Measurement, 2013 ACM/IEEE International Symposium on, pages 247--250. IEEE, 2013.
[11]
M. Zhou and A. Mockus. Who will stay in the floss community? modeling participant's initial behavior. Software Engineering, IEEE Transactions on, 41(1):82--99, 2015.

Cited By

View all
  • (2020)Standing on shoulders or feet? An extended study on the usage of the MSR data papersEmpirical Software Engineering10.1007/s10664-020-09834-7Online publication date: 18-Jul-2020
  • (2019)From Reports to Bug-Fix CommitsProceedings of the Fifteenth International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3345629.3345639(80-89)Online publication date: 18-Sep-2019
  • (2019)Reducing the workload of the Linux kernel maintainers: multiple-committer modelProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3338906.3342490(1205-1207)Online publication date: 12-Aug-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MSR '16: Proceedings of the 13th International Conference on Mining Software Repositories
May 2016
544 pages
ISBN:9781450341868
DOI:10.1145/2901739
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2016

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper

Funding Sources

Conference

ICSE '16
Sponsor:

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Standing on shoulders or feet? An extended study on the usage of the MSR data papersEmpirical Software Engineering10.1007/s10664-020-09834-7Online publication date: 18-Jul-2020
  • (2019)From Reports to Bug-Fix CommitsProceedings of the Fifteenth International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3345629.3345639(80-89)Online publication date: 18-Sep-2019
  • (2019)Reducing the workload of the Linux kernel maintainers: multiple-committer modelProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3338906.3342490(1205-1207)Online publication date: 12-Aug-2019
  • (2019)Standing on shoulders or feet?Proceedings of the 16th International Conference on Mining Software Repositories10.1109/MSR.2019.00085(565-576)Online publication date: 26-May-2019
  • (2018)A multi-level dataset of linux kernel patchworkProceedings of the 15th International Conference on Mining Software Repositories10.1145/3196398.3196475(54-57)Online publication date: 28-May-2018
  • (2017)On the scalability of Linux kernel maintainers' workProceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering10.1145/3106237.3106287(27-37)Online publication date: 21-Aug-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media