research-article

Mining and recommending software features across multiple web repositories

Authors:
Yue Yu

National University of Defense Technology, Changsha, China

National University of Defense Technology, Changsha, China
View Profile

,
Huaimin Wang

National University of Defense Technology, Changsha, China

National University of Defense Technology, Changsha, China
View Profile

,
Gang Yin

National University of Defense Technology, Changsha, China

National University of Defense Technology, Changsha, China
View Profile

,
Bo Liu

National University of Defense Technology, Changsha, China

National University of Defense Technology, Changsha, China
View Profile

Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on InternetwareOctober 2013Article No.: 9Pages 1–9https://doi.org/10.1145/2532443.2532453

Published:23 October 2013Publication History

Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on Internetware

Pages 1–9

ABSTRACT

The "Internetware" paradigm is fundamentally changing the traditional way of software development. More and more software projects are developed, maintained and shared on the Internet. However, a large quantity of heterogeneous software resources have not been organized in a reasonable and efficient way. Software feature is an ideal material to characterize software resources. The effectiveness of feature-related tasks will be greatly improved, if a multi-grained feature repository is available. In this paper, we propose a novel approach for organizing, analyzing and recommending software features. Firstly, we construct a Hierarchical rEpository of Software feAture (HESA). Then, we mine the hidden affinities among the features and recommend relevant and high-quality features to stakeholders based on HESA. Finally, we conduct a user study to evaluate our approach quantitatively. The results show that HESA can organize software features in a more reasonable way compared to the traditional and the state-of-the-art approaches. The result of feature recommendation is effective and interesting.

References

M. Acher, A. Cleve, G. Perrouin, P. Heymans, C. Vanbeneden, P. Collet, and P. Lahire. On extracting feature models from product descriptions. In VaMoS, pages 45--54, 2012. Google ScholarDigital Library
V. Alves, C. Schwanninger, L. Barbosa, A. Rashid, P. Sawyer, P. Rayson, C. Pohl, and A. Rummler. An exploratory study of information retrieval techniques in domain analysis. In SPLC, pages 67--76, 2008. Google ScholarDigital Library
S. Apel and C. Kastner. An overview of feature-oriented software development. pages 49--84, 2009.Google Scholar
H. U. Asuncion, A. U. Asuncion, and R. N. Taylor. Software traceability with topic modeling. In ICSE (1), pages 95--104, 2010. Google ScholarDigital Library
E. Bagheri, F. Ensan, and D. Gasevic. Decision support for the software product line domain engineering lifecycle. pages 335--377, 2012. Google ScholarDigital Library
D. Blei, A. Ng, and M. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3: 993--1022, 2003. Google ScholarDigital Library
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, pages 43--52. Morgan Kaufmann Publishers Inc., 1998. Google ScholarDigital Library
H. Dumitru, M. Gibiec, N. Hariri, J. Cleland-Huang, B. Mobasher, C. Castro-Herrera, and M. Mirakhorli. On-demand feature recommendations derived from mining public product descriptions. In ICSE, pages 181--190, 2011. Google ScholarDigital Library
W. B. Frakes, R. P. Dĺłaz, and C. J. Fox. Dare: Domain analysis and reuse environment. pages 125--141, 1998. Google ScholarDigital Library
T. Griffiths. Gibbs sampling in the generative model of Latent Dirichlet Allocation. Technical report, Stanford University, 2002.Google Scholar
J. Han, M. Kamber, and J. Pei. Data mining: concepts and techniques. Morgan kaufmann, 2006. Google ScholarDigital Library
A. E. Hassan. The road ahead for mining software repositories. 2008.Google Scholar
A. E. Hassan and T. Xie. Mining software engineering data. In ICSE (2), pages 503--504, 2010. Google ScholarDigital Library
K. C. Kang, S. G. Cohen, J. A. Hess, W. E. Novak, and A. S. Peterson. Feature-oriented domain analysis (foda) feasibility study. technical report. 1990.Google Scholar
K. Lee, K. C. Kang, and J. Lee. Concepts and guidelines of feature modeling for product line software engineering. In ICSR, pages 62--77, 2002. Google ScholarDigital Library
X. Li, H. Wang, G. Yin, T. Wang, C. Yang, Y. Yu, and D. Tang. Inducing taxonomy from tags: An agglomerative hierarchical clustering framework. In Advanced Data Mining and Applications, volume 7713, pages 64--77. Springer Berlin Heidelberg, 2012.Google ScholarCross Ref
H. Ma, D. Zhou, C. Liu, M. R. Lyu, and I. King. Recommender systems with social regularization. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 287--296. ACM, 2011. Google ScholarDigital Library
A. Maedche and S. Staab. Learning ontologies for the semantic web. In SemWeb, 2001.Google ScholarDigital Library
A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.Google Scholar
C. McMillan, N. Hariri, D. Poshyvanyk, J. Cleland-Huang, and B. Mobasher. Recommending source code for use in rapid software prototypes. In ICSE, pages 848--858, 2012. Google ScholarDigital Library
H. Mei, G. Huang, and T. Xie. Internetware: A software paradigm for internet computing. Computer, 45(6): 26--31, June 2012. Google ScholarDigital Library
H. Mei, G. Huang, H. Zhao, and W. Jiao. A software architecture centric engineering approach for internetware. Science in China Series F: Information Sciences, 49(6): 702--730, 2006.Google ScholarCross Ref
H. Mei and X. Liu. Internetware: An emerging software paradigm for internet computing. J. Comput. Sci. Technol., 26(4): 588--599, 2011.Google ScholarCross Ref
N. Niu and S. M. Easterbrook. On-demand cluster analysis for product line functional requirements. In SPLC, pages 87--96, 2008. Google ScholarDigital Library
S. Park, M. Kim, and V. Sugumaran. A scenario, goal and feature-oriented domain analysis approach for developing software product lines. pages 296--308, 2004.Google Scholar
J. D. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In Proceedings of the 22nd international conference on Machine learning, pages 713--719. ACM, 2005. Google ScholarDigital Library
J. Tang, H. fung Leung, Q. Luo, D. Chen, and J. Gong. Towards ontology learning from folksonomies. In IJCAI, pages 2089--2094, 2009. Google ScholarDigital Library
K. Tian, M. Revelle, and D. Poshyvanyk. Using latent dirichlet allocation for automatic categorization of software. In MSR, pages 163--166, 2009. Google ScholarDigital Library
X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. J. McLachlan, A. Ng, B. Liu, S. Y. Philip, et al. Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1): 1--37, 2008. Google ScholarDigital Library
Y. Yu, H. Wang, G. Yin, X. Li, and C. Yang. Hesa: The construction and evaluation of hierarchical software feature repository. In SEKE, pages 624--631, 2013.Google Scholar

Index Terms

Mining and recommending software features across multiple web repositories

Recommendations

Accuracy in Rating and Recommending Item Features
AH '08: Proceedings of the 5th international conference on Adaptive Hypermedia and Adaptive Web-Based Systems

This paper discusses accuracy in processing ratings of and recommendations for item features. Such processing facilitates feature-based user navigation in recommender system interfaces. Item features, often in the form of tags, categories or meta-data, ...
Read More
Recommending Serendipitous Items using Transfer Learning
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Most recommender algorithms are designed to suggest relevant items, but suggesting these items does not always result in user satisfaction. Therefore, the efforts in recommender systems recently shifted towards serendipity, but generating serendipitous ...
Read More
Recommending items to group of users using Matrix Factorization based Collaborative Filtering

Group recommender systems are becoming very popular in the social web owing to their ability to provide a set of recommendations to a group of users. Several group recommender systems have been proposed by extending traditional KNN based Collaborative ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on Internetware
October 2013
211 pages
ISBN:9781450323697
DOI:10.1145/2532443
Conference Chairs:
Hong Mei,
Jian Lv,
Program Chair:
Xiaoguang Mao
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
domain analysis
feature ontology
mining software repository
recommender system
Qualifiers
- research-article
Conference

Acceptance Rates
Internetware '13 Paper Acceptance Rate15of50submissions,30%Overall Acceptance Rate55of111submissions,50%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 20
  Total Citations
  View Citations
- 177
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Mining and recommending software features across multiple web repositories

Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on Internetware

ABSTRACT

References

Cited By

Index Terms

Recommendations

Accuracy in Rating and Recommending Item Features

Recommending Serendipitous Items using Transfer Learning

Recommending items to group of users using Matrix Factorization based Collaborative Filtering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Mining and recommending software features across multiple web repositories

Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on Internetware

ABSTRACT

References

Cited By

Index Terms

Recommendations

Accuracy in Rating and Recommending Item Features

Recommending Serendipitous Items using Transfer Learning

Recommending items to group of users using Matrix Factorization based Collaborative Filtering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media