Topic modeling and intuitionistic fuzzy set-based approach for efficient software bug triaging

Panda, Rama Ranjan; Nagwani, Naresh Kumar

doi:10.1007/s10115-022-01735-z

Topic modeling and intuitionistic fuzzy set-based approach for efficient software bug triaging

Regular Paper
Published: 20 August 2022

Volume 64, pages 3081–3111, (2022)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

352 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Modern software development involves multiple developers working remotely in a distributed manner around the world. Software bugs are continuously generated for multiple reasons across various modules. It is possible that one software bug can affect multiple modules, and there can be multiple developers associated with it. Furthermore, many software bug reports are unlabeled, vague, and noisy. The triager faces significant challenges in identifying multiple causes of software bugs and finding expert developers for bug fixing. In this paper, the fuzzy set is extended to Intuitionistic Fuzzy Sets (IFS), and a novel bug triaging approach based on Intuitionistic Fuzzy Similarity (IFSim) measures is presented to overcome the aforementioned problems. The topic model is used to discover multiple relationships between developers and software bugs. IFS is used to separate developers based on their degree of membership and non-membership in a particular software category, with a degree of hesitation for some developers. For a new bug, 15 different IFSim measure techniques are investigated to compute the similarity with the existing software bugs. Finally, a fuzzy \(\alpha \)-cut is applied to find expert developers to repair it. The best results are obtained by considering the number of topics of 15 and 12 taxonomic terms for each topic. Among all the IFSim measure techniques, the similarity techniques proposed by Ye outperform other techniques. Experiments are carried out on available benchmark data sets, and the results are compared to traditional machine learning algorithms and the fuzzy logic-based Bugzie model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Improved Software Bug Triaging Approach Based on Topic Modeling and Fuzzy Logic

Software bug priority prediction technique based on intuitionistic fuzzy representation and class imbalance learning

Article 10 October 2023

Rama Ranjan Panda & Naresh Kumar Nagwani

Semantic Categorization of Software Bug Repositories for Severity Assignment Automation

Notes

References

Alazzam I, Aleroud A, Al Latifah Z, Karabatis G (2020) Automatic bug triage in software systems using graph neighborhood relations for feature augmentation. IEEE Trans Comput Soc Syst 7(5):1288–1303
Article Google Scholar
Alkhazi B, DiStasi A, Aljedaani W, Alrubaye H, Ye X, Mkaouer MW (2020) Learning to rank developers for bug report assignment. Appl Soft Comput 95:106667
Article Google Scholar
Almhana R, Kessentini M (2021) Considering dependencies between bug reports to improve bugs triage. Autom Softw Eng 28(1):1–26
Article Google Scholar
Almhana R, Kessentini M, Mkaouer W (2021) Method-level bug localization using hybrid multi-objective search. Inf Softw Technol 131:106474
Article Google Scholar
Aung TWW, Wan Y, Huo H, Sui Y (2022) Multi-triage: a multi-task learning framework for bug triage. J Syst Softw 184:111133
Article Google Scholar
Bouchet A, Montes S, Ballarin V, Diaz I (2020) Intuitionistic fuzzy set and fuzzy mathematical morphology applied to color leukocytes segmentation. SIViP 14(3):557–564
Article Google Scholar
Chen SM (1995) Measures of similarity between vague sets. Fuzzy Sets Syst 74(2):217–223
Article MathSciNet MATH Google Scholar
Chen SM, Cheng SH, Lan TC (2016) A novel similarity measure between intuitionistic fuzzy sets based on the centroid points of transformed fuzzy numbers with applications to pattern recognition. Inf Sci 343:15–40
Article MathSciNet MATH Google Scholar
Chen TH, Thomas SW, Hassan AE (2016) A survey on the use of topic models when mining software repositories. Empir Softw Eng 21(5):1843–1919
Article Google Scholar
Cheng Y, Li Y, Yang J (2021) Multi-attribute decision-making method based on a novel distance measure of linguistic intuitionistic fuzzy sets. J Intell Fuzzy Syst 40(1):1147–1160
Article Google Scholar
Corley CS, Damevski K, Kraft NA (2018) Changeset-based topic modeling of software repositories. IEEE Trans Softw Eng 46(10):1068–1080
Article Google Scholar
Falessi D, Huang J, Narayana L, Thai JF, Turhan B (2020) On the need of preserving order of data when validating within-project defect classifiers. Empir Softw Eng 25(6):4805–4830
Article Google Scholar
Fan L, Zhangyan X (2001) Measures of similarity between vague sets. J Softw 12(6):922–927
Google Scholar
Garg H, Kumar K (2018) Distance measures for connection number sets based on set pair analysis and its applications to decision-making process. Appl Intell 48(10):3346–3359
Article Google Scholar
Ge X, Zheng S, Wang J, Li H (2020) High-dimensional hybrid data reduction for effective bug triage. Math Probl Eng 2020:1–20
Google Scholar
Goguen J (1973) La zadeh. fuzzy sets. information and control, vol. 8 (1965), pp. 338–353.-la zadeh. similarity relations and fuzzy orderings. information sciences, vol. 3 (1971), pp. 177–200. J Symb. Logic 38(4):656–657
Article Google Scholar
Guo S, Chen R, Wei M, Li H, Liu Y (2018) Ensemble data reduction techniques and multi-RSMOTE via fuzzy integral for bug report classification. IEEE Access 6:45934–45950
Article Google Scholar
Guo S, Zhang X, Yang X, Chen R, Guo C, Li H, Li T (2020) Developer activity motivated bug triaging: via convolutional neural network. Neural Process Lett 51(3):2589–2606
Article Google Scholar
Gupta C, Freire MM (2021) A decentralized blockchain oriented framework for automated bug assignment. Inf Softw Technol 134:106540
Article Google Scholar
Hamdy A, Ezzat G (2020) Deep mining of open source software bug repositories. Int J Comput Appl 44(7):614–622
Google Scholar
Herbold S, Trautsch A, Trautsch F (2020) On the feasibility of automated prediction of bug and non-bug issues. Empir Softw Eng 25(6):5333–5369
Article Google Scholar
Hong DH, Kim C (1999) A note on similarity measures between vague sets and between elements. Inf Sci 115(1–4):83–96
Article MathSciNet MATH Google Scholar
Hung WL, Yang MS (2008) On similarity measures between intuitionistic fuzzy sets. Int J Intell Syst 23(3):364–383
Article MATH Google Scholar
Jahanshahi H, Chhabra K, Cevik M, Baar A (2021) DABT: a dependency-aware bug triaging method. In: Evaluation and assessment in software engineering. ACM, pp 221–230
Jiang Q, Jin X, Lee SJ, Yao S (2019) A new similarity/distance measure between intuitionistic fuzzy sets based on the transformed isosceles triangles and its applications to pattern recognition. Expert Syst Appl 116:439–453
Article Google Scholar
Kashiwa Y, Ohira M (2020) A release-aware bug triaging method considering developers’ bug-fixing loads. IEICE Trans Inf Syst 103(2):348–362
Article Google Scholar
Kaushal M, Lohani QD (2021) Generalized intuitionistic fuzzy c-means clustering algorithm using an adaptive intuitionistic fuzzification technique. Granul Comput 7:183–195
Article Google Scholar
Krassimir TA, Parvathi R (1986) Intuitionistic fuzzy sets. Fuzzy Sets Syst 20(1):87–96
Article MATH Google Scholar
Lee DG, Seo YS (2020) Improving bug report triage performance using artificial intelligence based document generation model. HCIS 10(1):1–22
Google Scholar
Li Y, Olson DL, Qin Z (2007) Similarity measures between intuitionistic fuzzy (vague) sets: a comparative analysis. Pattern Recognit Lett 28(2):278–285
Article Google Scholar
Liu HW (2005) New similarity measures between intuitionistic fuzzy sets and between elements. Math Comput Model 42(1–2):61–70
Article MathSciNet MATH Google Scholar
Liu Q, Huang H, Xuan J, Zhang G, Gao Y, Lu J (2020) A fuzzy word similarity measure for selecting top-k similar words in query expansion. IEEE Trans Fuzzy Syst 29(8):2132–2144
Article Google Scholar
Maheshan M, Harish B (2021) A modified intuitionistic fuzzy clustering approach for sclera segmentation. SN Comput Sci 2(4):1–8
Article Google Scholar
Ngan RT, Cuong BC, Ali M et al (2018) H-max distance measure of intuitionistic fuzzy sets in decision making. Appl Soft Comput 69:393–425
Article Google Scholar
Panda RR, Nagwani NK (2019) Software bug categorization technique based on fuzzy similarity. In: 2019 IEEE 9th international conference on advanced computing (IACC). IEEE, pp 1–6
Panda RR, Nagwani NK (2021) Multi-label software bug categorisation based on fuzzy similarity. Int J Comput Sci Eng 24(3):244–258
Google Scholar
Pandolfo G, D’Ambrosio A, Cannavacciuolo L, Siciliano R (2020) Fuzzy logic aggregation of crisp data partitions as learning analytics in triage decisions. Expert Syst Appl 158:113512
Article Google Scholar
Panichella S, Zaugg N (2020) An empirical investigation of relevant changes and automation needs in modern code review. Empir Softw Eng 25(6):4833–4872
Article Google Scholar
Raji-Lawal HY, Akinwale AT, Folorunsho O, Mustapha AO (2020) Decision support system for dementia patients using intuitionistic fuzzy similarity measure. Soft Comput Lett 2:100005
Article Google Scholar
Rodríguez-Pérez G, Robles G, Serebrenik A, Zaidman A, Germán DM, Gonzalez-Barahona JM (2020) How bugs are born: a model to identify how bugs are introduced in software components. Empir Softw Eng 5(2):1294–1340
Article Google Scholar
Soltani M, Hermans F, Bäck T (2020) The significance of bug report elements. Empir Softw Eng 25(6):5255–5294
Article Google Scholar
Song Y, Wang X, Lei L, Xue A (2014) A new similarity measure between intuitionistic fuzzy sets and its application to pattern recognition. In: Abstract and applied analysis, vol 2014. Hindawi
Su Y, Xing Z, Peng X, Xia X, Wang C, Xu X, Zhu L (2021) Reducing bug triaging confusion by learning from mistakes with a bug tossing knowledge graph. In: 2021 36th IEEE/ACM international conference on automated software engineering (ASE). IEEE, pp 191–202
Sugeno M, Terano T (1977) A model of learning based on fuzzy information. Kybernetes 6(3):157–166
Article MATH Google Scholar
Tamrawi A, Nguyen TT, Al-Kofahi J, Nguyen TN (2011) Fuzzy set-based automatic bug triaging (NIER track). In: Proceedings of the 33rd international conference on software engineering, pp 884–887
Tamrawi A, Nguyen TT, Al-Kofahi JM, Nguyen TN (2011) Fuzzy set and cache-based approach for bug triaging. In: Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering, pp 365–375
Thao NX (2020) Similarity measures of picture fuzzy sets based on entropy and their application in MCDM. Pattern Anal Appl 23(3):1203–1213
Article MathSciNet Google Scholar
Tran HM, Le ST, Van Nguyen S, Ho PT (2020) An analysis of software bug reports using machine learning techniques. SN Comput Sci 1(1):4
Article Google Scholar
Wang Y, Yao Y, Tong H, Huo X, Li M, Xu F, Lu J (2020) Enhancing supervised bug localization with metadata and stack-trace. Knowl Inf Syst 62(6):2461–2484
Article Google Scholar
Wu X, Zheng W, Pu M, Chen J, Mu D (2020) Invalid bug reports complicate the software aging situation. Softw Qual J 28(1):195–220
Article Google Scholar
Xi SQ, Yao Y, Xiao XS, Xu F, Lv J (2019) Bug triaging based on tossing sequence modeling. J Comput Sci Technol 34(5):942–956
Article Google Scholar
Yager RR (1979) On the measure of fuzziness and negation part I: membership in the unit interval. Int J Gen Syst 5:221–229
Article MATH Google Scholar
Yang K, Cai Y, Leung HF, Lau RY, Li Q (2019) ITWF: a framework to apply term weighting schemes in topic model. Neurocomputing 350:248–260
Article Google Scholar
Ye J (2011) Cosine similarity measures for intuitionistic fuzzy sets and their applications. Math Comput Model 53(1–2):91–97
Article MathSciNet MATH Google Scholar
Zaidi SFA, Lee CG (2021) Learning graph representation of bug reports to triage bugs using graph convolution network. In: 2021 international conference on information networking (ICOIN). IEEE, pp 504–507

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, National Institute of Technology, Raipur, India
Rama Ranjan Panda & Naresh Kumar Nagwani

Authors

Rama Ranjan Panda
View author publications
You can also search for this author in PubMed Google Scholar
Naresh Kumar Nagwani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Rama Ranjan Panda or Naresh Kumar Nagwani.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Illustrative example

A sample of the Eclipse data set^{Footnote 7} is used to illustrate the proposed approach. Initially, a fixed number of \(S_\mathrm{b}\) in a given range is considered for conducting the experiment. Here, all the developers having bug counts between 45 and 70 are selected, and it is illustrated in Table 8.

Table 8 A sample bug distribution between developers for Eclipse data set

Topic modeling and intuitionistic fuzzy set-based approach for efficient software bug triaging

Abstract

Access this article

Similar content being viewed by others

An Improved Software Bug Triaging Approach Based on Topic Modeling and Fuzzy Logic

Software bug priority prediction technique based on intuitionistic fuzzy representation and class imbalance learning

Semantic Categorization of Software Bug Repositories for Severity Assignment Automation

Notes

References

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Appendices

Illustrative example

Experimental results

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation