skip to main content
10.1145/2597073.2597081acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
Article

Process mining multiple repositories for software defect resolution from control and organizational perspective

Published:31 May 2014Publication History

ABSTRACT

Issue reporting and resolution is a software engineering process supported by tools such as Issue Tracking System (ITS), Peer Code Review (PCR) system and Version Control System (VCS). Several open source software projects such as Google Chromium and Android follow process in which a defect or feature enhancement request is reported to an issue tracker followed by source-code change or patch review and patch commit using a version control system. We present an application of process mining three software repositories (ITS, PCR and VCS) from control flow and organizational perspective for effective process management. ITS, PCR and VCS are not explicitly linked so we implement regular expression based heuristics to integrate data from three repositories for Google Chromium project. We define activities such as bug reporting, bug fixing, bug verification, patch submission, patch review, and source code commit and create an event log of the bug resolution process. The extracted event log contains audit trail data such as caseID, timestamp, activity name and performer. We discover runtime process model for bug resolution process spanning three repositories using process mining tool, Disco, and conduct process performance and efficiency analysis. We identify bottlenecks, define and detect basic and composite anti-patterns. In addition to control flow analysis, we mine event log to perform organizational analysis and discover metrics such as handover of work, subcontracting, joint cases and joint activities.

References

  1. Burcu Akman and O Demirors. Applicability of process discovery algorithms for software organizations. In Software Engineering and Advanced Applications, 2009. SEAA’09. 35th Euromicro Conference on, pages 195–202. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Andrew Dittrich, Mehmet Hadi Gunes, and Sergiu Dascalu. Network analysis of software repositories: Identifying subject matter experts. In Complex Networks, pages 187–198. Springer, 2013.Google ScholarGoogle Scholar
  3. Rami-Habib Eid-Sabbagh, Remco Dijkman, and Mathias Weske. Business process architecture: use and correctness. In Business Process Management, pages 65–81. Springer, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Monika Gupta and Ashish Sureka. Nirikshan: Mining bug report history for discovering process maps, inefficiencies and inconsistencies. In Proceedings of the 7th India Software Engineering Conference. ACM, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Kazuki Hamasaki, Raula Gaikovina Kula, Norihiro Yoshida, AE Cruz, Kenji Fujiwara, and Hajimu Iida. Who does what during a code review? datasets of oss peer review repositories. In Proceedings of the Tenth International Workshop on Mining Software Repositories, pages 49–52. IEEE Press, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Kwanghoon Pio Kim. Mining workflow processes from distributed workflow enactment event logs. Knowledge Management & E-Learning: An International Journal (KM&EL), 4(4):528–&EL), 4(4):528––553, 2013.Google ScholarGoogle Scholar
  7. Ekkart Kindler, Vladimir Rubin, and Wilhelm Schäfer. Activity mining for discovering software process models. Software Engineering, 79:175–180, 2006.Google ScholarGoogle Scholar
  8. Patrick Knab, Martin Pinzger, and Harald C Gall. Visual patterns in issue tracking data. In New Modeling Concepts for Today’s Software Processes, pages 222–233. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Andrew Meneely, Mackenzie Corcoran, and Laurie Williams. Improving developer activity metrics with issue tracking annotations. In Proceedings of the 2010 ICSE Workshop on Emerging Trends in Software Metrics, pages 75–80. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Wouter Poncin, Alexander Serebrenik, and Mark van den Brand. Process mining software repositories. In Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on, pages 5–14. IEEE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Anita Sarma, Larry Maccherone, Patrick Wagstrom, and James Herbsleb. Tesseract: Interactive visual exploration of socio-technical relationships in software development. In Software Engineering, 2009. ICSE 2009. IEEE 31st International Conference on, pages 23–33. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Jinliang Song, Tiejian Luo, and Su Chen. Behavior pattern mining: Apply process mining technology to common event logs of information systems. In Networking, Sensing and Control, 2008. ICNSC 2008. IEEE International Conference on, pages 1800–1805. IEEE, 2008.Google ScholarGoogle Scholar
  13. Wikan Sunindyo, Thomas Moser, Dietmar Winkler, and Deepak Dhungana. Improving open source software process quality based on defect data mining. In Software Quality. Process Automation in Software Development, pages 84–102. Springer, 2012.Google ScholarGoogle Scholar
  14. Ashish Sureka, Atul Goyal, and Ayushi Rastogi. Using social network analysis for mining collaboration data in a defect tracking system for risk and vulnerability analysis. In Proceedings of the 4th India Software Engineering Conference, pages 195–204. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wil MP Van Der Aalst, Hajo A Reijers, and Minseok Song. Discovering social networks from event logs. Computer Supported Cooperative Work (CSCW), 14(6):549–593, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Wil MP van der Aalst, Hajo A Reijers, Anton JMM Weijters, Boudewijn F van Dongen, AK Alves de Medeiros, Minseok Song, and HMW Verbeek. Business process mining: An industrial application. Information Systems, 32(5):713–732, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Process mining multiple repositories for software defect resolution from control and organizational perspective

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader