Abstract
In this paper, we present an approach to software component ranking intended for use in searching for such components on the internet. The method used introduces a novel method of weighting keywords that takes account of where within the structure of a component the keyword is found. This hierarchical weighting scheme is used in two ranking algorithms: one using summed weights, the other using a vector space model. Experimental comparisons with algorithms using TF-IDF weighting that ignore component structure are described. The results demonstrate consistent superiority of the hierarchical weighting approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Andritsos, P., Tzerpos, V.: Information-Theoretic Software Clustering. IEEE Transactions on Software Engineering 31(2), 150–165 (2005)
Anquetil, N., Lethbridge, T.C.: Recovering Software Architecture from the Names of Source Files. J. Software Maintenance: Research and Practice 11, 201–221 (1999)
Baeza, R., Neto, B.: Modern Information Retrieval. ACM Press, Addison Wesley, New York (1999)
Belew, R.K.: Finding Out About A Cognitive Perspective on Search Engine Technology and the WWW. Cambridge University Press, Cambridge (2000)
Bernstein, A., Klein, M.: Towards High-Precision Service Retrieval. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, p. 84. Springer, Heidelberg (2002)
Braga, R.M.M., Mattoso, M., Werner, C.M.L.: The use of mediation and ontology technologies for software component information retrieval. In: Symposium on Software Reuse (SSR 2001), Toronto, Canada (2001)
Fischer, B.: Specification-Based Browsing of Software Component Libraries. Journal of Automated Software Engineering 7(2), 179–200 (2000)
Inoue, K., Yokomori, R., Fujiwara, H., Yamamoto, T., Matsushita, M., Kusumoto, S.: Component Rank: Relative Significance Rank for Software Component Search. In: Proc. International Conf. on Software Engineering (ICSE 2003), Portland, OR, pp. 14–24 (2003)
Lindig, C.: Concept-based component retrieval. In: Kohler, J., Giunchiglia, F., Green, C., Walther, C. (eds.) Working Notes of the ZJCAI-1995 Workshop: Formal Approaches to the Reuse of Plans, Proofs, and Programs, pp. 21–25 (1995)
Penix, J., Alexander, P.: Efficient Specification-Based Component Retrieval. In: Automated Software Engineering, vol. 6, pp. 139–170. Kluwer Academic Publishers, Dordrecht (1999)
Seacord, R., Hissan, S., Wallnau, K.: Agora: A Search Engine for Software Components. IEEE Internet Computing 2(6) (1998)
Sourceforge.: Sourceforge.net. (August 4, 2005), http://sourceforge.net/ Accessed
Sugumaran, V., Storey, V.C.: A Semantic-Based Approach to Component Retrieval. The DATA BASE for Advances in Information Systems, ACM SIG Management Information Systems  34(3), 8–24 (2003)
Washizaki, H., Fukazawa, Y.: Component-Extraction-based Search System for Object Oriented Programs. In: Bosch, J., Krueger, C. (eds.) ICOIN 2004 and ICSR 2004. LNCS, vol. 3107, pp. 254–263. Springer, Heidelberg (2004)
Zhang, Z., Svensson, L., Snis, U., Srensen, C., Fgerlind, H., Lindroth, T., Magnusson, M., Stlund, C.: Enhancing Component Reuse Using Search Techniques. In: Proceedings of IRIS 23, Laboratorium for Interaction Technology, University of Trollhttan Uddevalla (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gui, G., Scott, P.D. (2005). Vector Space Based on Hierarchical Weighting: A Component Ranking Approach to Component Retrieval. In: Cao, J., Nejdl, W., Xu, M. (eds) Advanced Parallel Processing Technologies. APPT 2005. Lecture Notes in Computer Science, vol 3756. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573937_21
Download citation
DOI: https://doi.org/10.1007/11573937_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29639-3
Online ISBN: 978-3-540-32107-1
eBook Packages: Computer ScienceComputer Science (R0)