skip to main content
article

Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories

Published:17 May 2005Publication History
Skip Abstract Section

Abstract

This paper presents an approach to recover time variant information from software repositories. It is widely accepted that software evolves due to factors such as defect removal, market opportunity or adding new features. Software evolution details are stored in software repositories which often contain the changes history. On the other hand there is a lack of approaches, technologies and methods to efficiently extract and represent time dependent information. Disciplines such as signal and image processing or speech recognition adopt frequency domain representations to mitigate differences of signals evolving in time. Inspired by time-frequency duality, this paper proposes the use of Linear Predictive Coding (LPC) and Cepstrum coefficients to model time varying software artifact histories. LPC or Cepstrum allow obtaining very compact representations with linear complexity. These representations can be used to highlight components and artifacts evolved in the same way or with very similar evolution patterns. To assess the proposed approach we applied LPC and Cepstral analysis to 211 Linux kernel releases (i.e., from 1.0 to 1.3.100), to identify files with very similar size histories. The approach, the preliminary results and the lesson learned are presented in this paper.

References

  1. Boehm, B. W. Software Engineering Echonomics. Prentice-Hall Inc., Englewood Cliffs, N.J., 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Boehm, B., Clark, B., Horowitz, E., Westland, C., Madachy, R., and Selby, R. Cost Models for Future Software Life Cycle Processes: COCOMO 2.0. Annals of Software Engineering. vol. 1, 1987, 57--94.Google ScholarGoogle ScholarCross RefCross Ref
  3. Hastings, T. E., and Sajeev, A. S. M. A Vector-Based Approach to Software Size Measurement and Effort Estimation. IEEE Transactions on Software Enginnering, vol. 27, no. 4, 2001, 337--350. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Itakura F., Minimum prediction residual principle applied to speech recognition, IEEE Trans. Acoustics, Speech, and Signal Processing. vol. 23, pp.67--72, Feb. 1975Google ScholarGoogle ScholarCross RefCross Ref
  5. Kalpakis K., Gada D., and Puttagunta V., "Distance Measures for Effective Clustering of ARIMA Time-Series". In Proc. of the 2001 IEEE International Conference on Data Mining (ICDM'01), San Jose, CA, November 29-December 2, 2001, pp. 273--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Lindvall, M. Monitoring and Measuring the Change-Prediction Process at Different Granularity Levels: An Empirical Study. Software Process Improvement and Practice, no. 4, 1998, 3--10.Google ScholarGoogle ScholarCross RefCross Ref
  7. Markel, J. D. and Gray Jr, A. H. Linear Prediction of Speech. Springer-Verlag, New York, 1976. Google ScholarGoogle ScholarCross RefCross Ref
  8. Myers C. S. and Rabiner L. R. A comparative study of several dynamic time-warping algorithms for connected word recognition. The Bell System Technical Journal, 60(7):1389--1409, September 1981Google ScholarGoogle ScholarCross RefCross Ref
  9. Mockus A., Weiss D. M., Zhang P. Understanding and Predicting effort In Software Projects. Proc. of the 25th International Conference On Software Engineering, 2003, 274--284 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Nesi, P. Managing Object Oriented Projects Better, IEEE Software, vol. 15, no.4. 1998, 50--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Oppenheim A. V. and Schafer R. W., "From Frequency to Quefrency: A History of the Cepstrum", IEEE Signal Processing Magazine, September 2004.Google ScholarGoogle ScholarCross RefCross Ref
  12. Papamichalis, P. E. Practical Approaches to Speech Coding. Prentice Hall, Englewood Cliffs, NJ, 1987 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Rabiner, L. R. and Juang B. H. Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs, NJ, 1993 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ramil, J. F. Algorithmic Cost Estimation Software Evolution. Proceding of Int. Conference on Software Engineeringr, Limerick, Ireland, IEEE CS Press, 2000, 701--703. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wu, Q. Z., Jou, I. C., Lee, S. Y., Online Signature Verification Using LPC Cepstrum and Neural Networks, IEEE Transactions on Systems, Man, and Cybernetics (27), No. 1, February 1997, pp. 148--153. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGSOFT Software Engineering Notes
        ACM SIGSOFT Software Engineering Notes  Volume 30, Issue 4
        July 2005
        1514 pages
        ISSN:0163-5948
        DOI:10.1145/1082983
        Issue’s Table of Contents
        • cover image ACM Other conferences
          MSR '05: Proceedings of the 2005 international workshop on Mining software repositories
          May 2005
          109 pages
          ISBN:1595931236
          DOI:10.1145/1083142

        Copyright © 2005 Copyright is held by the owner/author(s)

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 May 2005

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader