ABSTRACT
The "Internetware" paradigm is fundamentally changing the traditional way of software development. More and more software projects are developed, maintained and shared on the Internet. However, a large quantity of heterogeneous software resources have not been organized in a reasonable and efficient way. Software feature is an ideal material to characterize software resources. The effectiveness of feature-related tasks will be greatly improved, if a multi-grained feature repository is available. In this paper, we propose a novel approach for organizing, analyzing and recommending software features. Firstly, we construct a Hierarchical rEpository of Software feAture (HESA). Then, we mine the hidden affinities among the features and recommend relevant and high-quality features to stakeholders based on HESA. Finally, we conduct a user study to evaluate our approach quantitatively. The results show that HESA can organize software features in a more reasonable way compared to the traditional and the state-of-the-art approaches. The result of feature recommendation is effective and interesting.
- M. Acher, A. Cleve, G. Perrouin, P. Heymans, C. Vanbeneden, P. Collet, and P. Lahire. On extracting feature models from product descriptions. In VaMoS, pages 45--54, 2012. Google ScholarDigital Library
- V. Alves, C. Schwanninger, L. Barbosa, A. Rashid, P. Sawyer, P. Rayson, C. Pohl, and A. Rummler. An exploratory study of information retrieval techniques in domain analysis. In SPLC, pages 67--76, 2008. Google ScholarDigital Library
- S. Apel and C. Kastner. An overview of feature-oriented software development. pages 49--84, 2009.Google Scholar
- H. U. Asuncion, A. U. Asuncion, and R. N. Taylor. Software traceability with topic modeling. In ICSE (1), pages 95--104, 2010. Google ScholarDigital Library
- E. Bagheri, F. Ensan, and D. Gasevic. Decision support for the software product line domain engineering lifecycle. pages 335--377, 2012. Google ScholarDigital Library
- D. Blei, A. Ng, and M. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3: 993--1022, 2003. Google ScholarDigital Library
- J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, pages 43--52. Morgan Kaufmann Publishers Inc., 1998. Google ScholarDigital Library
- H. Dumitru, M. Gibiec, N. Hariri, J. Cleland-Huang, B. Mobasher, C. Castro-Herrera, and M. Mirakhorli. On-demand feature recommendations derived from mining public product descriptions. In ICSE, pages 181--190, 2011. Google ScholarDigital Library
- W. B. Frakes, R. P. Dĺłaz, and C. J. Fox. Dare: Domain analysis and reuse environment. pages 125--141, 1998. Google ScholarDigital Library
- T. Griffiths. Gibbs sampling in the generative model of Latent Dirichlet Allocation. Technical report, Stanford University, 2002.Google Scholar
- J. Han, M. Kamber, and J. Pei. Data mining: concepts and techniques. Morgan kaufmann, 2006. Google ScholarDigital Library
- A. E. Hassan. The road ahead for mining software repositories. 2008.Google Scholar
- A. E. Hassan and T. Xie. Mining software engineering data. In ICSE (2), pages 503--504, 2010. Google ScholarDigital Library
- K. C. Kang, S. G. Cohen, J. A. Hess, W. E. Novak, and A. S. Peterson. Feature-oriented domain analysis (foda) feasibility study. technical report. 1990.Google Scholar
- K. Lee, K. C. Kang, and J. Lee. Concepts and guidelines of feature modeling for product line software engineering. In ICSR, pages 62--77, 2002. Google ScholarDigital Library
- X. Li, H. Wang, G. Yin, T. Wang, C. Yang, Y. Yu, and D. Tang. Inducing taxonomy from tags: An agglomerative hierarchical clustering framework. In Advanced Data Mining and Applications, volume 7713, pages 64--77. Springer Berlin Heidelberg, 2012.Google ScholarCross Ref
- H. Ma, D. Zhou, C. Liu, M. R. Lyu, and I. King. Recommender systems with social regularization. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 287--296. ACM, 2011. Google ScholarDigital Library
- A. Maedche and S. Staab. Learning ontologies for the semantic web. In SemWeb, 2001.Google ScholarDigital Library
- A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.Google Scholar
- C. McMillan, N. Hariri, D. Poshyvanyk, J. Cleland-Huang, and B. Mobasher. Recommending source code for use in rapid software prototypes. In ICSE, pages 848--858, 2012. Google ScholarDigital Library
- H. Mei, G. Huang, and T. Xie. Internetware: A software paradigm for internet computing. Computer, 45(6): 26--31, June 2012. Google ScholarDigital Library
- H. Mei, G. Huang, H. Zhao, and W. Jiao. A software architecture centric engineering approach for internetware. Science in China Series F: Information Sciences, 49(6): 702--730, 2006.Google ScholarCross Ref
- H. Mei and X. Liu. Internetware: An emerging software paradigm for internet computing. J. Comput. Sci. Technol., 26(4): 588--599, 2011.Google ScholarCross Ref
- N. Niu and S. M. Easterbrook. On-demand cluster analysis for product line functional requirements. In SPLC, pages 87--96, 2008. Google ScholarDigital Library
- S. Park, M. Kim, and V. Sugumaran. A scenario, goal and feature-oriented domain analysis approach for developing software product lines. pages 296--308, 2004.Google Scholar
- J. D. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In Proceedings of the 22nd international conference on Machine learning, pages 713--719. ACM, 2005. Google ScholarDigital Library
- J. Tang, H. fung Leung, Q. Luo, D. Chen, and J. Gong. Towards ontology learning from folksonomies. In IJCAI, pages 2089--2094, 2009. Google ScholarDigital Library
- K. Tian, M. Revelle, and D. Poshyvanyk. Using latent dirichlet allocation for automatic categorization of software. In MSR, pages 163--166, 2009. Google ScholarDigital Library
- X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. J. McLachlan, A. Ng, B. Liu, S. Y. Philip, et al. Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1): 1--37, 2008. Google ScholarDigital Library
- Y. Yu, H. Wang, G. Yin, X. Li, and C. Yang. Hesa: The construction and evaluation of hierarchical software feature repository. In SEKE, pages 624--631, 2013.Google Scholar
Index Terms
- Mining and recommending software features across multiple web repositories
Recommendations
Accuracy in Rating and Recommending Item Features
AH '08: Proceedings of the 5th international conference on Adaptive Hypermedia and Adaptive Web-Based SystemsThis paper discusses accuracy in processing ratings of and recommendations for item features. Such processing facilitates feature-based user navigation in recommender system interfaces. Item features, often in the form of tags, categories or meta-data, ...
Recommending Serendipitous Items using Transfer Learning
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge ManagementMost recommender algorithms are designed to suggest relevant items, but suggesting these items does not always result in user satisfaction. Therefore, the efforts in recommender systems recently shifted towards serendipity, but generating serendipitous ...
Recommending items to group of users using Matrix Factorization based Collaborative Filtering
Group recommender systems are becoming very popular in the social web owing to their ability to provide a set of recommendations to a group of users. Several group recommender systems have been proposed by extending traditional KNN based Collaborative ...
Comments