skip to main content
10.1145/2016039.2016138acmconferencesArticle/Chapter ViewAbstractPublication Pagesacm-seConference Proceedingsconference-collections
poster

Toward an understanding of the relationship between the identifier and comment lexicons

Published:24 March 2011Publication History

ABSTRACT

Source code retrieval techniques show efficacy in the automation of software understanding activities, but the literature provides no guidance regarding the impact of comments on the performance of these techniques. In this paper we present an initial investigation of the effects of using comments in the source code retrieval process. We address our research question using a case study of six open source Java projects. The results indicate that the inclusion of comments significantly affects the average keyword density for a project. Future work includes analyzing the extent to which comments affect the average keyword density of domain terms and non-domain terms.

References

  1. S. Abebe, S. Haiduc, A. Marcus, P. Tonella, and G. Antoniol. Analyzing the evolution of the source code vocabulary. In Proceedings of the 13th European Conference on Software Maintenance and Reengineering, pages 189--198, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Boehm. Software Engineering Economics. Prentice Hall, 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Fluri, M. Wursch, and H. Gall. Do code and comments co-evolve? on the relation between source code and comment changes. In Proceedings of the 14th Working Conference on Reverse Engineering, pages 70--79, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Haiduc and A. Marcus. On the use of domain terms in source code. In Proceedings of the 16th International Conference on Program Comprehension, pages 113--122, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Müller, J. Jahnke, D. Smith, M.-A. Storey, S. Tilley, and K. Wong. Reverse engineering: A roadmap. In Proceedings of the Future of Software Engineering, pages 47--60, June 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Toward an understanding of the relationship between the identifier and comment lexicons

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader