skip to main content
10.1145/2532443.2532453acmotherconferencesArticle/Chapter ViewAbstractPublication PagesinternetwareConference Proceedingsconference-collections
research-article

Mining and recommending software features across multiple web repositories

Authors Info & Claims
Published:23 October 2013Publication History

ABSTRACT

The "Internetware" paradigm is fundamentally changing the traditional way of software development. More and more software projects are developed, maintained and shared on the Internet. However, a large quantity of heterogeneous software resources have not been organized in a reasonable and efficient way. Software feature is an ideal material to characterize software resources. The effectiveness of feature-related tasks will be greatly improved, if a multi-grained feature repository is available. In this paper, we propose a novel approach for organizing, analyzing and recommending software features. Firstly, we construct a Hierarchical rEpository of Software feAture (HESA). Then, we mine the hidden affinities among the features and recommend relevant and high-quality features to stakeholders based on HESA. Finally, we conduct a user study to evaluate our approach quantitatively. The results show that HESA can organize software features in a more reasonable way compared to the traditional and the state-of-the-art approaches. The result of feature recommendation is effective and interesting.

References

  1. M. Acher, A. Cleve, G. Perrouin, P. Heymans, C. Vanbeneden, P. Collet, and P. Lahire. On extracting feature models from product descriptions. In VaMoS, pages 45--54, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. V. Alves, C. Schwanninger, L. Barbosa, A. Rashid, P. Sawyer, P. Rayson, C. Pohl, and A. Rummler. An exploratory study of information retrieval techniques in domain analysis. In SPLC, pages 67--76, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Apel and C. Kastner. An overview of feature-oriented software development. pages 49--84, 2009.Google ScholarGoogle Scholar
  4. H. U. Asuncion, A. U. Asuncion, and R. N. Taylor. Software traceability with topic modeling. In ICSE (1), pages 95--104, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Bagheri, F. Ensan, and D. Gasevic. Decision support for the software product line domain engineering lifecycle. pages 335--377, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Blei, A. Ng, and M. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3: 993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, pages 43--52. Morgan Kaufmann Publishers Inc., 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Dumitru, M. Gibiec, N. Hariri, J. Cleland-Huang, B. Mobasher, C. Castro-Herrera, and M. Mirakhorli. On-demand feature recommendations derived from mining public product descriptions. In ICSE, pages 181--190, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. B. Frakes, R. P. Dĺłaz, and C. J. Fox. Dare: Domain analysis and reuse environment. pages 125--141, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Griffiths. Gibbs sampling in the generative model of Latent Dirichlet Allocation. Technical report, Stanford University, 2002.Google ScholarGoogle Scholar
  11. J. Han, M. Kamber, and J. Pei. Data mining: concepts and techniques. Morgan kaufmann, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. E. Hassan. The road ahead for mining software repositories. 2008.Google ScholarGoogle Scholar
  13. A. E. Hassan and T. Xie. Mining software engineering data. In ICSE (2), pages 503--504, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. K. C. Kang, S. G. Cohen, J. A. Hess, W. E. Novak, and A. S. Peterson. Feature-oriented domain analysis (foda) feasibility study. technical report. 1990.Google ScholarGoogle Scholar
  15. K. Lee, K. C. Kang, and J. Lee. Concepts and guidelines of feature modeling for product line software engineering. In ICSR, pages 62--77, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Li, H. Wang, G. Yin, T. Wang, C. Yang, Y. Yu, and D. Tang. Inducing taxonomy from tags: An agglomerative hierarchical clustering framework. In Advanced Data Mining and Applications, volume 7713, pages 64--77. Springer Berlin Heidelberg, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  17. H. Ma, D. Zhou, C. Liu, M. R. Lyu, and I. King. Recommender systems with social regularization. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 287--296. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Maedche and S. Staab. Learning ontologies for the semantic web. In SemWeb, 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu, 2002.Google ScholarGoogle Scholar
  20. C. McMillan, N. Hariri, D. Poshyvanyk, J. Cleland-Huang, and B. Mobasher. Recommending source code for use in rapid software prototypes. In ICSE, pages 848--858, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. Mei, G. Huang, and T. Xie. Internetware: A software paradigm for internet computing. Computer, 45(6): 26--31, June 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Mei, G. Huang, H. Zhao, and W. Jiao. A software architecture centric engineering approach for internetware. Science in China Series F: Information Sciences, 49(6): 702--730, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  23. H. Mei and X. Liu. Internetware: An emerging software paradigm for internet computing. J. Comput. Sci. Technol., 26(4): 588--599, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  24. N. Niu and S. M. Easterbrook. On-demand cluster analysis for product line functional requirements. In SPLC, pages 87--96, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Park, M. Kim, and V. Sugumaran. A scenario, goal and feature-oriented domain analysis approach for developing software product lines. pages 296--308, 2004.Google ScholarGoogle Scholar
  26. J. D. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In Proceedings of the 22nd international conference on Machine learning, pages 713--719. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Tang, H. fung Leung, Q. Luo, D. Chen, and J. Gong. Towards ontology learning from folksonomies. In IJCAI, pages 2089--2094, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. K. Tian, M. Revelle, and D. Poshyvanyk. Using latent dirichlet allocation for automatic categorization of software. In MSR, pages 163--166, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. J. McLachlan, A. Ng, B. Liu, S. Y. Philip, et al. Top 10 algorithms in data mining. Knowledge and Information Systems, 14(1): 1--37, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Y. Yu, H. Wang, G. Yin, X. Li, and C. Yang. Hesa: The construction and evaluation of hierarchical software feature repository. In SEKE, pages 624--631, 2013.Google ScholarGoogle Scholar

Index Terms

  1. Mining and recommending software features across multiple web repositories

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            Internetware '13: Proceedings of the 5th Asia-Pacific Symposium on Internetware
            October 2013
            211 pages
            ISBN:9781450323697
            DOI:10.1145/2532443

            Copyright © 2013 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 23 October 2013

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Internetware '13 Paper Acceptance Rate15of50submissions,30%Overall Acceptance Rate55of111submissions,50%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader