Unit Test Data Generation for C Using Rule-Directed Symbolic Execution

Zhang, Ming-Zhe; Gong, Yun-Zhan; Wang, Ya-Wen; Jin, Da-Hai

doi:10.1007/s11390-019-1935-7

Unit Test Data Generation for C Using Rule-Directed Symbolic Execution

Regular Paper
Published: 10 May 2019

Volume 34, pages 670–689, (2019)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Ming-Zhe Zhang¹,
Yun-Zhan Gong¹,
Ya-Wen Wang¹ &
…
Da-Hai Jin¹

122 Accesses
4 Citations
Explore all metrics

Abstract

Unit testing is widely used in software development. One important activity in unit testing is automatic test data generation. Constraint-based test data generation is a technique for automatic generation of test data, which uses symbolic execution to generate constraints. Unit testing only tests functions instead of the whole program, where individual functions typically have preconditions imposed on their inputs. Conventional symbolic execution cannot detect these preconditions, let alone converting these preconditions into constraints. To overcome these limitations, we propose a novel unit test data generation approach using rule-directed symbolic execution for dealing with functions with missing input preconditions. Rule-directed symbolic execution uses predefined rules to detect preconditions in the individual function, and generates constraints for inputs based on preconditions. We introduce implicit constraints to represent preconditions, and unify implicit constraints and program constraints into integrated constraints. Test data generated based on integrated constraints can explore previously unreachable code and help developers find more functional faults and logical faults. We have implemented our approach in a tool called CTS-IC, and applied it to real-world projects. The experimental results show that rule-directed symbolic execution can find preconditions (implicit constraints) automatically from an individual function. Moreover, the unit test data generated by our approach achieves higher coverage than similar tools and efficiently mitigates missing input preconditions problems in unit testing for individual functions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Generating Test Suites with Augmented Dynamic Symbolic Execution

Towards Efficient Data-Flow Test Data Generation

A Framework for Guided Test Case Generation in Constraint Logic Programming

References

Li B, Vendome C, Linares-Vásquez M, Poshyvanyk D, Kraft N A. Automatically documenting unit test cases. In Proc. IEEE International Conference on the Software Testing, Verification and Validation, April 2016, pp.341-352.
Lam W, Srisakaokul S, Bassett B, Mahdian P, Xie T, Tillmann N, de Halleux J. Parameterized unit testing in the open source wild. Technical Report, IDEALS, 2015. http://hdl.handle.net/2142/88374, Dec. 2018.
Zhang B, Hill E, Clause J. Towards automatically generating descriptive names for unit tests. In Proc. the 31st IEEE/ACM International Conference on Automated Software Engineering, September 2016, pp.625-636.
Yoshida H, Tokumoto S, Prasad M R, Ghosh I, Uehara T. FSX: Fine-grained incremental unit test generation for C/C++ programs. In Proc. the 25th International Symposium on Software Testing and Analysis, July 2016, pp.106-117.
DeMilli R A, Offutt A J. Constraint-based automatic test data generation. IEEE Transactions on Software Engineering, 1991, 17(9): 900-910.
Article Google Scholar
Boonstoppel P, Cadar C, Engler D. RWset: Attacking path explosion in constraint-based test generation. In Proc. the 14th International Conference on Tools and Algorithms for the Construction and Analysis of Systems, March 2008, pp.351-366.
Boyer R S, Elspas B, Levitt K N. SELECT — A formal system for testing and debugging programs by symbolic execution. ACM SIGPLAN Notices, 1975, 10(6): 234-245.
Article Google Scholar
Cadar C, Sen K. Symbolic execution for software testing: Three decades later. Communications of the ACM, 2013, 56(2): 82-90.
Article Google Scholar
Cadar C, Dunbar D, Engler D R. KLEE: Unassisted and automatic generation of high-coverage tests for complex systems programs. In Proc. the 8th USENIX Symposium on Operating Systems Design and Implementation, December 2008, pp.209-224.
Ganesh V, Dill D L. A decision procedure for bit-vectors and arrays. In Proc. the 19th International Conference on Computer Aided Verification, July 2007, pp.519-531.
de Moura L, Bjørner N. Z3: An efficient SMT solver. In Proc. the 14th International Conference on Tools and Algorithms for the Construction and Analysis of Systems, March 2008, pp.337-340.
Tillmann N, de Halleux J. Pex-white box test generation for .NET. In Proc. the 2nd International Conference on Tests and Proofs, April 2008, pp.134-153.
Cadar C, Ganesh V, Pawlowski P M, Dill D L, Engler D R. EXE: Automatically generating inputs of death. ACM Transactions on Information and System Security, 2008, 12(2): Article No. 10.
Engler D R, Dunbar D. Under-constrained execution: Making automatic code destruction easy and scalable. In Proc. the 2007 ACM/SIGSOFT International Symposium on Software Testing and Analysis, July 2007, pp.1-4.
Burnim J, Sen K. Heuristics for scalable dynamic test generation. In Proc. the 23rd IEEE/ACM International Conference on Automated Software Engineering, September 2008, pp.443-446.
Hutchins M, Foster H, Goradia T, Ostrand T. Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria. In Proc. the 16th International Conference on Software Engineering, May 1994, pp.191-200.
Xing Y, Gong Y Z, Wang Y W, Zhang X Z. Branch and bound framework for automatic test case generation. SCIENTIA SINICA Informationis, 2014, 44(10): 1345-1360.
Google Scholar
Kernighan B W, Ritchie D M. The C Programming Language (2nd edition). Prentice hall, 1988.
Zhang X Z, Gong Y Z, Wang YW, Xing Y, Zhang M Z. Automated string constraints solving for programs containing string manipulation functions. Journal of Computer Science and Technology, 2017, 32(6): 1125-1135.
Article MathSciNet Google Scholar
Aho A V, Sethi R, Ullman J D. Compilers: Principles, Techniques, and Tools. Addison-Wesley, 1986.
Lin M X, Chen Y L, Yu K, Wu G S. Lazy symbolic execution for test data generation. IET Software, 2011, 5(2): 132-141.
Article Google Scholar
Li G, Ghosh I. Lazy symbolic execution through abstraction and sub-space search. In Proc. the 9th International Haifa Verification Conference on Hardware and Software: Verification and Testing, November 2013, pp.295-310.
Brack-Bernsen L, Hunger H. On the “Atypical Astronomical Cuneiform Text E”: A mean-value scheme for predicting lunar latitude. Archiv fur Orientforschung, 2005, 51: 96-107.
Google Scholar
Arcuri A, Iqbal M Z, Briand L. Formal analysis of the effectiveness and predictability of random testing. In Proc. the 19th International Symposium on Software Testing and Analysis, July 2010, pp.219-230.
Sen K, Marinov D, Agha G. Cute: A concolic unit testing engine for C. In Proc. the 10th European Software Engineering Conference, September 2005, pp.263-272.
Godefroid P, Levin M Y, Molnar D. SAGE: Whitebox fuzzing for security testing. Communications of the ACM, 2012, 55(3): 40-44.
Article Google Scholar
Yoshida H, Li G, Kamiya T, Ghosh I, Rajan S, Tokumoto S, Munakata K, Uehara T. KLOVER: Automatic test generation for C and C++ programs, using symbolic execution. IEEE Software, 2017, 34(5): 30-37.
Article Google Scholar
Ramos D A, Engler D R. Under-constrained symbolic execution: Correctness checking for real code. In Proc. the 24th USENIX Security Symposium, August 2015, pp.49-64.
Nori A V, Rajamani S K. An empirical study of optimizations in YOGI. In Proc. the 32nd ACM/IEEE International Conference on Software Engineering, Volume 1, May 2010, pp.355-364.
Zhang D, Liu D, Lei Y, Kung D, Csallner C, Wang W. Detecting vulnerabilities in C programs using trace-based testing. In Proc. the 2010 IEEE/IFIP International Conference on Dependable Systems Networks, June 2010, pp.241-250.
Li H, Kim T, Bat-Erdene M, Lee H. Software vulnerability detection using backward trace analysis and symbolic execution. In Proc. the 2013 International Conference on Availability, Reliability and Security, September 2013, pp.446-454.
Kim Y, Kim Y, Kim T, Lee G, Jang Y, Kim M. Automated unit testing of large industrial embedded software using concolic testing. In Proc. the 28th IEEE/ACM International Conference on Automated Software Engineering, November 2013, pp.519-528.

Download references

Author information

Authors and Affiliations

State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Ming-Zhe Zhang, Yun-Zhan Gong, Ya-Wen Wang & Da-Hai Jin

Authors

Ming-Zhe Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Zhan Gong
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Wen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Da-Hai Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ya-Wen Wang.

Electronic supplementary material

ESM 1

(PDF 748 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, MZ., Gong, YZ., Wang, YW. et al. Unit Test Data Generation for C Using Rule-Directed Symbolic Execution. J. Comput. Sci. Technol. 34, 670–689 (2019). https://doi.org/10.1007/s11390-019-1935-7

Download citation

Received: 06 July 2018
Revised: 19 March 2019
Published: 10 May 2019
Issue Date: May 2019
DOI: https://doi.org/10.1007/s11390-019-1935-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Unit Test Data Generation for C Using Rule-Directed Symbolic Execution

Abstract

Access this article

Similar content being viewed by others

Generating Test Suites with Augmented Dynamic Symbolic Execution

Towards Efficient Data-Flow Test Data Generation

A Framework for Guided Test Case Generation in Constraint Logic Programming

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Unit Test Data Generation for C Using Rule-Directed Symbolic Execution

Abstract

Access this article

Similar content being viewed by others

Generating Test Suites with Augmented Dynamic Symbolic Execution

Towards Efficient Data-Flow Test Data Generation

A Framework for Guided Test Case Generation in Constraint Logic Programming

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation