research-article

Drift: Fine-Grained Prediction of the Co-Evolution of Production and Test Code via Machine Learning

Authors:

Lei Liu,

Sinan Wang,

Yepang Liu,

Jinliang Deng,

Sicen LiuAuthors Info & Claims

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on Internetware

Pages 227 - 237

https://doi.org/10.1145/3609437.3609449

Published: 05 October 2023 Publication History

Get Access

Abstract

As production code evolves, test code can quickly become outdated. When test code is outdated, it may fail to capture errors in the programs under test and can lead to serious software bugs that result in significant losses for both developers and users. To ensure high software quality, it is crucial to promptly update the test code after making changes to the production code. This practice ensures that the test code and production code evolve together, reducing the likelihood of errors and ensuring the software remains reliable. However, maintaining test code can be challenging and time-consuming. To automate the identification of outdated test code, recent research has proposed Sitar, a machine learning-based method. Despite Sitar’s usefulness, it has major limitations, including its coarse prediction granularity (at class level), reliance on naming conventions to discover test code, and dependence on manually summarized features to construct machine learning models.

In this paper, we address the limitations of Sitar and propose a new machine learning-based approach Drift. Drift predicts outdated test cases at the method level. It leverages method-calling relationships to accurately infer the links between production and test code, and automatically learns features via code analysis. We evaluate Drift using 40 open-source Java projects in both within-project and cross-project scenarios, and find that Drift can achieve satisfactory prediction performances in both scenarios. We also compare Drift with existing methods for outdated test code prediction and find that Drift can significantly outperform them. For example, compared with Sitar, the accuracy of Drift is increased by about 8.5%, the F1-score is increased by about 8.3%, and more importantly, the number of test cases that developers need to check is reduced by about 75%. Therefore, our method, Drift, can predict outdated test cases more accurately at a fine-grained level, and thus better facilitate the co-evolution of production and test code.

References

[1]

Apache. [n. d.]. Maven - introduction to the standard directory layout. http://maven.apache.org/guides/introduction/

Abstract

References

Index Terms

Recommendations

Revisiting the Identification of the Co-evolution of Production and Test Code

Excluding code from test coverage: practices, motivations, and impact

Identify and Update Test Cases when Production Code Changes: A Transformer-Based Approach

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations