research-article

Effective Recommendation of Cross-Project Correlated Issues based on Issue Metrics

Authors:

Changhai NieAuthors Info & Claims

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on Internetware

Page 1

https://doi.org/10.1145/3609437.3609462

Published: 05 October 2023 Publication History

Abstract

The calling relationship between projects becomes complicated as the number of open-source projects increases. Different issues across projects can also be related, referred to as cross-project correlated issues (CPCIs), and bring new challenges for developers to fix these issues. When solving these CPCIs, developers have to accurately locate the source code that causes it in the current project and also needs to know the related issues in other projects. However, few studies have proposed specific methods to help developers effectively address these CPCIs, i.e., find related issues for CPCIs.

This paper proposes a novel issue recommendation model for CPCIs. When developers fix a CPCI, they can find its associated issues based on our model. We first extract 26 issue metrics on CPCIs from four aspects: text similarity, cooperative relationship between developers, developers’ familiarity with the project, and developers’ fixing experience. Then, we utilize three classifiers (SVM, Logistic Regression, and Random Forest) to build CPCI recommendation models. To evaluate the model’s performance, we construct three baseline models based on text features and build experiments in the Python scientific computing software ecosystem, which mainly includes seven open-source software libraries. Moreover, we employ three indicators to measure the experimental results, i.e., MAP, MRR, and Recall-rate@k. The CPCI recommendation models built based on issue features have significantly better experimental results than the baseline models in most cases, which indicates that these issue metrics help recommend CPCIs.

References

[1]

R. Abreu, P. Zoeteweij, and A. J. c. Van Gemund. 2006. An Evaluation of Similarity Coefficients for Software Fault Localization. In 2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC’06). 39–46.

Digital Library

[2]

Rafi Almhana and Marouane Kessentini. 2021. Considering dependencies between bug reports to improve bugs triage. Autom. Softw. Eng. 28, 1 (2021), 1. https://doi.org/10.1007/s10515-020-00279-2

Digital Library

[3]

John Anvik, Lyndon Hiew, and Gail C. Murphy. 2005. Coping with an open bug repository. In OOPSLA Workshop on Eclipse Technology Exchange, Etx 2005, San Diego, California, Usa, October. 35–39.

Digital Library

[4]

Thazin Win Win Aung, Yao Wan, Huan Huo, and Yulei Sui. 2022. Multi-triage: A multi-task learning framework for bug triage. Journal of Systems and Software 184 (2022), 111133. https://doi.org/10.1016/j.jss.2021.111133

Digital Library

[5]

Thazin Win Win Aung, Yao Wan, Huan Huo, and Yulei Sui. 2022. Multi-triage: A multi-task learning framework for bug triage. J. Syst. Softw. 184 (2022), 111133. https://doi.org/10.1016/j.jss.2021.111133

Digital Library

[6]

Umamaheswara Sharma Bhutamapuram and Ravichandra Sadam. 2023. How far does the predictive decision impact the software project? The cost, service time, and failure analysis from a cross-project defect prediction model. J. Syst. Softw. 195 (2023), 111522. https://doi.org/10.1016/j.jss.2022.111522

Digital Library

[7]

Sarah Boslaugh and Paul Watters. 2008. Statistics in a nutshell: A desktop quick reference.

[8]

Yan Cai, Hao Yun, Jinqiu Wang, Lei Qiao, and Jens Palsberg. 2021. Sound and Efficient Concurrency Bug Prediction. In Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (Athens, Greece) (ESEC/FSE 2021). Association for Computing Machinery, New York, NY, USA, 255–267. https://doi.org/10.1145/3468264.3468549

Digital Library

[9]

Alexandre Decan, Tom Mens, and Philippe Grosjean. 2019. An empirical comparison of dependency network evolution in seven software packaging ecosystems. Empirical Software Engineering 24 (2019), 381–416.

Digital Library

[10]

Hui Ding, Wanwangying Ma, Lin Chen, Yuming Zhou, and Baowen Xu. 2017. An Empirical Study on Downstream Workarounds for Cross-Project Bugs. In 24th Asia-Pacific Software Engineering Conference, APSEC 2017, Nanjing, China, December 4-8, 2017, Jian Lv, He Jason Zhang, Mike Hinchey, and Xiao Liu (Eds.). IEEE Computer Society, 318–327. https://doi.org/10.1109/APSEC.2017.38

[11]

Ross B. Girshick. 2015. Fast R-CNN. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015. IEEE Computer Society, 1440–1448. https://doi.org/10.1109/ICCV.2015.169

Digital Library

[12]

Ross B. Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, Columbus, OH, USA, June 23-28, 2014. IEEE Computer Society, 580–587. https://doi.org/10.1109/CVPR.2014.81

Digital Library

[13]

Luiz Gomes, Ricardo da Silva Torres, and Mario Lúcio Côrtes. 2023. BERT- and TF-IDF-based feature extraction for long-lived bug prediction in FLOSS: A comparative study. Information and Software Technology 160 (2023), 107217. https://doi.org/10.1016/j.infsof.2023.107217

Digital Library

[14]

D. Han, C. Zhang, X. Fan, A. Hindle, K. Wong, and E. Stroulia. 2012. Understanding Android Fragmentation with Topic Analysis of Vendor-Specific Bugs. In 2012 19th Working Conference on Reverse Engineering. 83–92.

[15]

Hadi Jahanshahi and Mucahit Cevik. 2022. S-DABT: Schedule and Dependency-aware Bug Triage in open-source bug tracking systems. Inf. Softw. Technol. 151 (2022), 107025. https://doi.org/10.1016/j.infsof.2022.107025

Digital Library

[16]

Lisha Li, Zhilei Ren, Xiaochen Li, Weiqin Zou, and He Jiang. 2018. How Are Issue Units Linked? Empirical Study on the Linking Behavior in GitHub. In 2018 25th Asia-Pacific Software Engineering Conference (APSEC). 386–395. https://doi.org/10.1109/APSEC.2018.00053

[17]

Xinyu Liu, Qi Zhou, Joy Arulraj, and Alessandro Orso. 2022. Automatic Detection of Performance Bugs in Database Systems using Equivalent Queries. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022. ACM, 225–236. https://doi.org/10.1145/3510003.3510093

Digital Library

[18]

Wanwangying Ma, Lin Chen, Xiangyu Zhang, Yang Feng, Zhaogui Xu, Zhifei Chen, Yuming Zhou, and Baowen Xu. 2020. Impact analysis of cross-project bugs on software ecosystems. In ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, Gregg Rothermel and Doo-Hwan Bae (Eds.). ACM, 100–111. https://doi.org/10.1145/3377811.3380442

Digital Library

[19]

Wanwangying Ma, Lin Chen, Xiangyu Zhang, Yuming Zhou, and Baowen Xu. 2017. How Do Developers Fix Cross-Project Correlated Bugs? A Case Study on the GitHub Scientific Python Ecosystem. In Ieee/acm International Conference on Software Engineering. 381–392.

[20]

Xiangxin Meng, Xu Wang, Hongyu Zhang, Hailong Sun, and Xudong Liu. 2022. Improving Fault Localization and Program Repair with Deep Semantic Features and Transferred Knowledge. In Proceedings of the 44th International Conference on Software Engineering (Pittsburgh, Pennsylvania) (ICSE ’22). Association for Computing Machinery, New York, NY, USA, 1169–1180. https://doi.org/10.1145/3510003.3510147

Digital Library

[21]

Md Nadim, Debajyoti Mondal, and Chanchal K. Roy. 2022. Leveraging Structural Properties of Source Code Graphs for Just-in-Time Bug Prediction. Automated Software Engg. 29, 1 (may 2022), 30 pages. https://doi.org/10.1007/s10515-022-00326-0

Digital Library

[22]

Takamune Onishi and Hiromitsu Shiina. 2020. Distributed Representation Computation Using CBOW Model and Skip-gram Model. In 9th International Congress on Advanced Applied Informatics, IIAI-AAI 2020, Kitakyushu, Japan, September 1-15, 2020, Tokuro Matsuo, Kunihiko Takamatsu, Yuichi Ono, and Sachio Hirokawa (Eds.). IEEE, 845–846. https://doi.org/10.1109/IIAI-AAI50415.2020.00179

[23]

Weifeng Pan, Ming Hua, Zijiang Yang, and Tian Wang. 2022. Comments on "Using $k$k-Core Decomposition on Class Dependency Networks to Improve Bug Prediction Model’s Practical Performance". IEEE Trans. Software Eng. 48, 12 (2022), 5176–5187. https://doi.org/10.1109/TSE.2022.3140599

[24]

Jevgenija Pantiuchina, Fiorella Zampetti, Simone Scalabrino, Valentina Piantadosi, Rocco Oliveto, Gabriele Bavota, and Massimiliano Di Penta. 2020. Why Developers Refactor Source Code: A Mining-based Study. ACM Trans. Softw. Eng. Methodol. 29, 4 (2020), 29:1–29:30. https://doi.org/10.1145/3408302

Digital Library

[25]

Fayola Peters, Thein Than Tun, Yijun Yu, and Bashar Nuseibeh. 2019. Text Filtering and Ranking for Security Bug Report Prediction. IEEE Trans. Software Eng. 45, 6 (2019), 615–631. https://doi.org/10.1109/TSE.2017.2787653

[26]

Yu Qu, Qinghua Zheng, Jianlei Chi, Yangxu Jin, Ancheng He, Di Cui, Hengshan Zhang, and Ting Liu. 2021. Using K-core Decomposition on Class Dependency Networks to Improve Bug Prediction Model’s Practical Performance. IEEE Trans. Software Eng. 47, 2 (2021), 348–366. https://doi.org/10.1109/TSE.2019.2892959

Digital Library

[27]

Stephen Robertson, Hugo Zaragoza, and Michael Taylor. 2004. Simple BM25 Extension to Multiple Weighted Fields. In Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management (Washington, D.C., USA) (CIKM ’04). ACM, New York, NY, USA, 42–49.

Digital Library

[28]

Henrique Rocha, Marco Tulio Valente, Humberto Marques-Neto, and Gail C. Murphy. 2016. An Empirical Study on Recommendations of Similar Bugs. In IEEE International Conference on Software Analysis, Evolution, and Reengineering. 46–56.

[29]

Korosh Koochekian Sabor, Mohammad Hamdaqa, and Abdelwahab Hamou-Lhadj. 2020. Automatic prediction of the severity of bugs using stack traces and categorical features. Inf. Softw. Technol. 123 (2020), 106205. https://doi.org/10.1016/j.infsof.2019.106205

[30]

Chengnian Sun, David Lo, Siau-Cheng Khoo, and Jing Jiang. 2011. Towards more accurate retrieval of duplicate bug reports. In 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, KS, USA, November 6-10, 2011, Perry Alexander, Corina S. Pasareanu, and John G. Hosking (Eds.). IEEE Computer Society, 253–262. https://doi.org/10.1109/ASE.2011.6100061

Digital Library

[31]

Sadia Tabassum, Leandro L. Minku, and Danyi Feng. 2023. Cross-Project Online Just-In-Time Software Defect Prediction. IEEE Trans. Software Eng. 49, 1 (2023), 268–287. https://doi.org/10.1109/TSE.2022.3150153

[32]

Shin Hwei Tan and Ziqiang Li. 2020. Collaborative bug finding for Android apps. In ICSE ’20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, Gregg Rothermel and Doo-Hwan Bae (Eds.). ACM, 1335–1347. https://doi.org/10.1145/3377811.3380349

Digital Library

[33]

Dimitrios Tsoukalas, Nikolaos Mittas, Alexandros Chatzigeorgiou, Dionisis D. Kehagias, Apostolos Ampatzoglou, Theodoros Amanatidis, and Lefteris Angelis. 2021. Machine Learning for Technical Debt Identification. IEEE Transactions on Software Engineering (2021), 1–1. https://doi.org/10.1109/TSE.2021.3129355

[34]

Junjie Wang, Ye Yang, Song Wang, Jun Hu, and Qing Wang. 2022. Context- and Fairness-Aware In-Process Crowdworker Recommendation. ACM Trans. Softw. Eng. Methodol. 31, 3, Article 35 (mar 2022), 31 pages. https://doi.org/10.1145/3487571

Digital Library

[35]

Mohammad Wardat, Breno Dantas Cruz, Wei Le, and Hridesh Rajan. 2022. DeepDiagnosis: Automatically Diagnosing Faults and Recommending Actionable Fixes in Deep Learning Programs. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022. ACM, 561–572. https://doi.org/10.1145/3510003.3510071

Digital Library

[36]

Emily Winter, David Bowes, Steve Counsell, Tracy Hall, Saemundur O. Haraldsson, Vesna Nowack, and John R. Woodward. 2023. How do Developers Really Feel About Bug Fixing? Directions for Automatic Program Repair. IEEE Trans. Software Eng. 49, 4 (2023), 1823–1841. https://doi.org/10.1109/TSE.2022.3194188

Digital Library

[37]

Xinli Yang, David Lo, Xin Xia, Lingfeng Bao, and Jianling Sun. 2016. Combining Word Embedding with Information Retrieval to Recommend Similar Bug Reports. In IEEE International Symposium on Software Reliability Engineering. 127–137.

[38]

Yibiao Yang, Yuming Zhou, Hongmin Lu, Lin Chen, Zhenyu Chen, Baowen Xu, Hareton K. N. Leung, and Zhenyu Zhang. 2015. Are Slice-Based Cohesion Metrics Actually Useful in Effort-Aware Post-Release Fault-Proneness Prediction? An Empirical Study. IEEE Trans. Software Eng. 41, 4 (2015), 331–357. https://doi.org/10.1109/TSE.2014.2370048

Digital Library

[39]

Xin Ye, Hui Shen, Xiao Ma, Razvan C. Bunescu, and Chang Liu. 2016. From word embeddings to document similarities for improved information retrieval in software engineering. In Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016. 404–415. https://doi.org/10.1145/2884781.2884862

Digital Library

[40]

Wen Zhang, Jiangpeng Zhao, Rui Peng, Song Wang, and Ye Yang. 2023. SusRec: An Approach to Sustainable Developer Recommendation for Bug Resolution Using Multimodal Ensemble Learning. IEEE Trans. Reliab. 72, 1 (2023), 61–78. https://doi.org/10.1109/TR.2022.3176733

[41]

Guoliang Zhao, Safwat Hassan, Ying Zou, Derek Truong, and Toby Corbin. 2021. Predicting Performance Anomalies in Software Systems at Run-time. ACM Trans. Softw. Eng. Methodol. 30, 3 (2021), 33:1–33:33. https://doi.org/10.1145/3440757

Digital Library

[42]

J. Zhou, H. Zhang, and D. Lo. 2012. Where should the bugs be fixed? More accurate information retrieval-based bug localization based on bug reports. In 2012 34th International Conference on Software Engineering (ICSE). 14–24. https://doi.org/10.1109/ICSE.2012.6227210

[43]

Yuming Zhou, Baowen Xu, and Hareton Leung. 2010. On the ability of complexity metrics to predict fault-prone classes in object-oriented systems. J. Syst. Softw. 83, 4 (2010), 660–674. https://doi.org/10.1016/j.jss.2009.11.704

Digital Library

[44]

T. Zimmermann, A. Zeller, P. Weissgerber, and S. Diehl. 2005. Mining version histories to guide software changes. IEEE Transactions on Software Engineering 31, 6 (June 2005), 429–445. https://doi.org/10.1109/TSE.2005.72

Digital Library

Index Terms

Effective Recommendation of Cross-Project Correlated Issues based on Issue Metrics
1. Social and professional topics
  1. Professional topics
    1. Management of computing and information systems
      1. Software management
2. Software and its engineering
  1. Software creation and management
    1. Collaboration in software development
      1. Programming teams
  2. Software notations and tools
    1. Software configuration management and version control systems

Index terms have been assigned to the content through auto-classification.

Recommendations

Are Slice-Based Cohesion Metrics Actually Useful in Effort-Aware Post-Release Fault-Proneness Prediction? An Empirical Study
Background. Slice-based cohesion metrics leverage program slices with respect to the output variables of a module to quantify the strength of functional relatedness of the elements within the module. Although slice-based cohesion metrics have been ...
Just‐in‐time identification for cross‐project correlated issues
Abstract
Issue tracking systems are now prevalent in software development, which would help developers submit and discuss issues to solve development problems on software projects. Most previous studies have been conducted to analyze issue relations ...
Empirical analysis of change metrics for software fault prediction
Abstract
A quality assurance activity, known as software fault prediction, can reduce development costs and improve software quality. The objective of this study is to investigate change metrics in conjunction with code metrics to improve the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on Internetware

August 2023

332 pages

ISBN:9798400708947

DOI:10.1145/3609437

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 October 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Natural Science Foundation of China

Conference

Internetware 2023

Internetware 2023: 14th Asia-Pacific Symposium on Internetware

August 4 - 6, 2023

Hangzhou, China

Acceptance Rates

Overall Acceptance Rate 55 of 111 submissions, 50%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
77
Total Downloads

Downloads (Last 12 months)35
Downloads (Last 6 weeks)2

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten