Reference Hub1
Automatically Labelled Software Topic Model

Automatically Labelled Software Topic Model

Youcef Bouziane, Mustapha Kamel Abdi, Salah Sadou
Copyright: © 2020 |Volume: 11 |Issue: 1 |Pages: 22
ISSN: 1942-3926|EISSN: 1942-3934|EISBN13: 9781799806059|DOI: 10.4018/IJOSSP.2020010104
Cite Article Cite Article

MLA

Bouziane, Youcef, et al. "Automatically Labelled Software Topic Model." IJOSSP vol.11, no.1 2020: pp.57-78. http://doi.org/10.4018/IJOSSP.2020010104

APA

Bouziane, Y., Abdi, M. K., & Sadou, S. (2020). Automatically Labelled Software Topic Model. International Journal of Open Source Software and Processes (IJOSSP), 11(1), 57-78. http://doi.org/10.4018/IJOSSP.2020010104

Chicago

Bouziane, Youcef, Mustapha Kamel Abdi, and Salah Sadou. "Automatically Labelled Software Topic Model," International Journal of Open Source Software and Processes (IJOSSP) 11, no.1: 57-78. http://doi.org/10.4018/IJOSSP.2020010104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Public software repositories (SR) maintain a massive amount of valuable data offering opportunities to support software engineering (SE) tasks. Researchers have applied information retrieval techniques in mining software repositories. Topic models are one of these techniques. However, this technique does not give an interpretation nor labels to the extracted topics and it requires manual analysis to identify them. Some approaches were proposed to automatically label the topics using tags in SR, but they do not consider the existence of spam-tags and they have difficulties to scale to large tag space. This article introduces a novel approach called automatically labelled software topic model (AL-STM) that labels the topics based on observed tags in SR. It mitigates the shortcomings of manual and automatic labelling of topics in SE. AL-STM is implemented using 22K GitHub projects and evaluated in a SE task (tag recommending) against the currently used techniques. The empirical results suggest that AL-STM is more robust in terms of MAP and nDCG, and more scalable to large tag space.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.