Abstract
Open source software (OSS) platform users frequently join issue discussions in various repositories, and establish numerous co-talk (i.e. cross-issue reference and discussion) relationships between issues both within or cross repositories. In this work, we collect and analyze issue discussion data from GitHub to study the unique features of co-talk relationships, and discover that many participants play a versatile role during issue discussions across repositories. Based on the discovery, we enhance existing bug triaging technologies with the Relation and Heterogeneousness Enhance (RHE) method to include potential participants with cross-repository co-talk histories. RHE integrates co-talk relationship embedding and heterogeneous graph embedding for complex OSS communities. We conduct experiments with real-world data collected from GitHub to show the effectiveness and usefulness of RHE. The results suggest that RHE achieves an improved performance comparing to the baseline approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bird, S.: NLTK: the natural language toolkit. In: COLING/ACL 2006, Sydney, Australia, pp. 69–72. Association for Computational Linguistics, July 2006. https://doi.org/10.3115/1225403.1225421
Borges, H., Tulio Valente, M.: What’s in a GitHub Star? Understanding repository starring practices in a social coding platform. J. Syst. Softw. 146, 112–129 (2018). https://doi.org/10.1016/j.jss.2018.09.016
Čubranić, D.: Automatic bug triage using text categorization. In: SEKE 2004, pp. 92–97. KSI Press (2004)
Dong, Y., Chawla, N.V., Swami, A.: Metapath2vec: Scalable representation learning for heterogeneous networks. In: KDD 2017, Halifax, NS, Canada, pp. 135–144, KDD 2017. Association for Computing Machinery, August 2017. https://doi.org/10.1145/3097983.3098036
Fellbaum, C.: WordNet: An Electronic Lexical Database. The MIT Press, Cambridge (1998). https://doi.org/10.7551/mitpress/7287.001.0001
Grover, A., Leskovec, J.: Node2vec: scalable feature learning for networks. In: KDD 2016, San Francisco, California, USA, pp. 855–864. ACM Press (2016). https://doi.org/10.1145/2939672.2939754
Hannebauer, C., Patalas, M., Stünkel, S., Gruhn, V.: Automatically recommending code reviewers based on their expertise: an empirical comparison. In: ASE 2016, Singapore, Singapore, pp. 99–110. ACM Press (2016). https://doi.org/10.1145/2970276.2970306
Jeong, G., Kim, S., Zimmermann, T.: Improving bug triage with bug tossing graphs. In: ESEC/FSE 2009, Amsterdam, The Netherlands, p. 111. ACM Press (2009). https://doi.org/10.1145/1595696.1595715
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: ICML 2014, Beijing, China, pp. 1188–1196. PMLR (2014)
Li, J., Xu, Z., Tang, Y., Zhao, B., Tian, H.: Deep hybrid knowledge graph embedding for Top-N recommendation. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 59–70. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_6
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning Entity and Relation Embeddings for Knowledge Graph Completion. In: AAAI 2015 (Feb 2015)
Ma, W., Chen, L., Zhang, X., Zhou, Y., Xu, B.: How do developers fix cross-project correlated bugs? A case study on the GitHub scientific Python ecosystem. ICSE 2017, 381–392 (2017). https://doi.org/10.1109/ICSE.2017.42
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 [cs], September 2013
Porter, M.: An algorithm for suffix stripping. Prog. Electron. Libr. Inf. Syst. 14(3), 130–137 (1980). https://doi.org/10.1108/eb046814
Rahman, M.M., Roy, C.K., Collins, J.A.: CoRReCT: code reviewer recommendation in GitHub based on cross-project and technology experience. In: ICSE 2016, Austin, Texas, pp. 222–231. ACM Press (2016). https://doi.org/10.1145/2889160.2889244
Tan, S.H., Ziqiang, L.: Collaborative bug finding for Android apps. In: ICSE 2020, Seoul, Republic of Korea, pp. 1335–1347. ACM Press (2020). https://doi.org/10.1145/3377811.3380349
Xi, S.-Q., Yao, Y., Xiao, X.-S., Xu, F., Lv, J.: Bug triaging based on tossing sequence modeling. J. Comput. Sci. Technol. 34(5), 942–956 (2019). https://doi.org/10.1007/s11390-019-1953-5
Xia, X., Lo, D., Ding, Y., Al-Kofahi, J.M., Nguyen, T.N., Wang, X.: Improving automated bug triaging with specialized topic model. IEEE Trans. Softw. Eng. 43(3), 272–297 (2017). https://doi.org/10.1109/TSE.2016.2576454
Yu, Y., Wang, H., Yin, G., Wang, T.: Reviewer recommendation for pull-requests in GitHub: What can we learn from code review and bug assignment? Inf. Softw. Technol. 74, 204–218 (2016). https://doi.org/10.1016/j.infsof.2016.01.004
Zhang, Y., Wu, Y., Wang, T., Wang, H.: iLinker: a novel approach for issue knowledge acquisition in GitHub projects. World Wide Web 23(3), 1589–1619 (2020). https://doi.org/10.1007/s11280-019-00770-1
Acknowledgement
This work is supported by the National Key R&D Program of China under Grant No. 2018AAA0102302, and the Collaborative Innovation Center of Novel Software Technology and Industrialization.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, H., Wang, L., Tao, X., Hu, H. (2021). RHE: Relation and Heterogeneousness Enhanced Issue Participants Recommendation. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_52
Download citation
DOI: https://doi.org/10.1007/978-3-030-87571-8_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87570-1
Online ISBN: 978-3-030-87571-8
eBook Packages: Computer ScienceComputer Science (R0)