skip to main content
10.1145/2597073.2597081acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
Article

Process mining multiple repositories for software defect resolution from control and organizational perspective

Published: 31 May 2014 Publication History

Abstract

Issue reporting and resolution is a software engineering process supported by tools such as Issue Tracking System (ITS), Peer Code Review (PCR) system and Version Control System (VCS). Several open source software projects such as Google Chromium and Android follow process in which a defect or feature enhancement request is reported to an issue tracker followed by source-code change or patch review and patch commit using a version control system. We present an application of process mining three software repositories (ITS, PCR and VCS) from control flow and organizational perspective for effective process management. ITS, PCR and VCS are not explicitly linked so we implement regular expression based heuristics to integrate data from three repositories for Google Chromium project. We define activities such as bug reporting, bug fixing, bug verification, patch submission, patch review, and source code commit and create an event log of the bug resolution process. The extracted event log contains audit trail data such as caseID, timestamp, activity name and performer. We discover runtime process model for bug resolution process spanning three repositories using process mining tool, Disco, and conduct process performance and efficiency analysis. We identify bottlenecks, define and detect basic and composite anti-patterns. In addition to control flow analysis, we mine event log to perform organizational analysis and discover metrics such as handover of work, subcontracting, joint cases and joint activities.

References

[1]
Burcu Akman and O Demirors. Applicability of process discovery algorithms for software organizations. In Software Engineering and Advanced Applications, 2009. SEAA’09. 35th Euromicro Conference on, pages 195–202. IEEE, 2009.
[2]
Andrew Dittrich, Mehmet Hadi Gunes, and Sergiu Dascalu. Network analysis of software repositories: Identifying subject matter experts. In Complex Networks, pages 187–198. Springer, 2013.
[3]
Rami-Habib Eid-Sabbagh, Remco Dijkman, and Mathias Weske. Business process architecture: use and correctness. In Business Process Management, pages 65–81. Springer, 2012.
[4]
Monika Gupta and Ashish Sureka. Nirikshan: Mining bug report history for discovering process maps, inefficiencies and inconsistencies. In Proceedings of the 7th India Software Engineering Conference. ACM, 2014.
[5]
Kazuki Hamasaki, Raula Gaikovina Kula, Norihiro Yoshida, AE Cruz, Kenji Fujiwara, and Hajimu Iida. Who does what during a code review? datasets of oss peer review repositories. In Proceedings of the Tenth International Workshop on Mining Software Repositories, pages 49–52. IEEE Press, 2013.
[6]
Kwanghoon Pio Kim. Mining workflow processes from distributed workflow enactment event logs. Knowledge Management & E-Learning: An International Journal (KM&EL), 4(4):528–&EL), 4(4):528––553, 2013.
[7]
Ekkart Kindler, Vladimir Rubin, and Wilhelm Schäfer. Activity mining for discovering software process models. Software Engineering, 79:175–180, 2006.
[8]
Patrick Knab, Martin Pinzger, and Harald C Gall. Visual patterns in issue tracking data. In New Modeling Concepts for Today’s Software Processes, pages 222–233. Springer, 2010.
[9]
Andrew Meneely, Mackenzie Corcoran, and Laurie Williams. Improving developer activity metrics with issue tracking annotations. In Proceedings of the 2010 ICSE Workshop on Emerging Trends in Software Metrics, pages 75–80. ACM, 2010.
[10]
Wouter Poncin, Alexander Serebrenik, and Mark van den Brand. Process mining software repositories. In Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on, pages 5–14. IEEE, 2011.
[11]
Anita Sarma, Larry Maccherone, Patrick Wagstrom, and James Herbsleb. Tesseract: Interactive visual exploration of socio-technical relationships in software development. In Software Engineering, 2009. ICSE 2009. IEEE 31st International Conference on, pages 23–33. IEEE, 2009.
[12]
Jinliang Song, Tiejian Luo, and Su Chen. Behavior pattern mining: Apply process mining technology to common event logs of information systems. In Networking, Sensing and Control, 2008. ICNSC 2008. IEEE International Conference on, pages 1800–1805. IEEE, 2008.
[13]
Wikan Sunindyo, Thomas Moser, Dietmar Winkler, and Deepak Dhungana. Improving open source software process quality based on defect data mining. In Software Quality. Process Automation in Software Development, pages 84–102. Springer, 2012.
[14]
Ashish Sureka, Atul Goyal, and Ayushi Rastogi. Using social network analysis for mining collaboration data in a defect tracking system for risk and vulnerability analysis. In Proceedings of the 4th India Software Engineering Conference, pages 195–204. ACM, 2011.
[15]
Wil MP Van Der Aalst, Hajo A Reijers, and Minseok Song. Discovering social networks from event logs. Computer Supported Cooperative Work (CSCW), 14(6):549–593, 2005.
[16]
Wil MP van der Aalst, Hajo A Reijers, Anton JMM Weijters, Boudewijn F van Dongen, AK Alves de Medeiros, Minseok Song, and HMW Verbeek. Business process mining: An industrial application. Information Systems, 32(5):713–732, 2007.

Cited By

View all
  • (2024)Data-Driven Identification and Analysis of Waiting Times in Business ProcessesBusiness & Information Systems Engineering10.1007/s12599-024-00868-5Online publication date: 15-May-2024
  • (2024)Teaching Mining Software RepositoriesHandbook on Teaching Empirical Software Engineering10.1007/978-3-031-71769-7_12(325-362)Online publication date: 25-Dec-2024
  • (2023)Advancing Process Audits With Process Mining: A Systematic Review of Trends, Challenges, and OpportunitiesIEEE Access10.1109/ACCESS.2023.329211711(68340-68357)Online publication date: 2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MSR 2014: Proceedings of the 11th Working Conference on Mining Software Repositories
May 2014
427 pages
ISBN:9781450328630
DOI:10.1145/2597073
  • General Chair:
  • Premkumar Devanbu,
  • Program Chairs:
  • Sung Kim,
  • Martin Pinzger
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

  • TCSE: IEEE Computer Society's Tech. Council on Software Engin.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Empirical Software Engineering and Measurements
  2. Issue Tracking System
  3. Peer Code Review System
  4. Process Mining
  5. Social Network Analysis
  6. Software Maintenance

Qualifiers

  • Article

Conference

ICSE '14
Sponsor:

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)4
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Data-Driven Identification and Analysis of Waiting Times in Business ProcessesBusiness & Information Systems Engineering10.1007/s12599-024-00868-5Online publication date: 15-May-2024
  • (2024)Teaching Mining Software RepositoriesHandbook on Teaching Empirical Software Engineering10.1007/978-3-031-71769-7_12(325-362)Online publication date: 25-Dec-2024
  • (2023)Advancing Process Audits With Process Mining: A Systematic Review of Trends, Challenges, and OpportunitiesIEEE Access10.1109/ACCESS.2023.329211711(68340-68357)Online publication date: 2023
  • (2023)Mining Repository for Module Reuse: A Machine Learning-Based ApproachInternational Conference on IoT, Intelligent Computing and Security10.1007/978-981-19-8136-4_6(71-81)Online publication date: 2-Apr-2023
  • (2023)Finding Behavioral Indicators from Contextualized Commits in Software Engineering Courses with Process MiningFrontiers in Software Engineering Education10.1007/978-3-031-48639-5_5(56-68)Online publication date: 1-Dec-2023
  • (2022)Process Mining Model Integrated with Control Flow, Case, Organizational and Time Perspectives in a Software Development Project2022 10th International Conference in Software Engineering Research and Innovation (CONISOFT)10.1109/CONISOFT55708.2022.00022(92-101)Online publication date: Oct-2022
  • (2022)Taxonomy of bug tracking process smellsInformation and Software Technology10.1016/j.infsof.2022.106972150:COnline publication date: 4-Aug-2022
  • (2022)Towards a taxonomy of code review smellsInformation and Software Technology10.1016/j.infsof.2021.106737142:COnline publication date: 1-Feb-2022
  • (2021)A Systematic Mapping Study on Analysis of Code RepositoriesInformatica10.15388/21-INFOR454(1-42)Online publication date: 2-Jun-2021
  • (2021)Towards a Taxonomy of Bug Tracking Process Smells: A Quantitative Analysis2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA53835.2021.00026(138-147)Online publication date: Sep-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media