Evaluation of model checkers by verifying message passing programs

Hong, Weijiang; Chen, Zhenbang; Yu, Hengbiao; Wang, Ji

doi:10.1007/s11432-018-9825-3

Evaluation of model checkers by verifying message passing programs

Review
Published: 03 September 2019

Volume 62, article number 200101, (2019)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Weijiang Hong^1,2,
Zhenbang Chen¹,
Hengbiao Yu^1,2 &
…
Ji Wang^1,2

94 Accesses
6 Citations
Explore all metrics

Abstract

Benchmarks and evaluation are important for the development of techniques and tools. Studies regarding evaluation of model checkers by large-scale benchmarks are few. The lack of such studies is mainly because of the language difference of existing model checkers and the requirement of intensive labor in building models. In this study, we present a large-scale benchmark for evaluating model checkers whose inputs are concurrent models. The benchmark consists of 2318 models that are generated automatically from real-world message passing interface (MPI) programs. The complexities of the models have been inspected to be well distributed and suitable for evaluating model checkers. Based on the benchmark, we have evaluated five state-of-the-art model checkers, i.e., PAT, FDR, Spin, PRISM, and NuSMV, by verifying the deadlock freedom property. The evaluation results demonstrate the ability and performance difference of these model checkers in verifying message passing programs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reliable benchmarking: requirements and solutions

Article Open access 03 November 2017

Formal Runtime Error Detection During Development in the Automotive Industry

Combining Model Checking and Data-Flow Analysis

References

Clarke E M, Grumberg O, Peled D A. Model Checking. Cambridge: MIT Press, 2001
Book MATH Google Scholar
Frappier M, Fraikin B, Chossart R, et al. Comparison of model checking tools for information systems. In: Proceedings of the 12th International Conference on Formal Engineering Methods, 2010. 581–596
Pelánek R. BEEM: benchmarks for explicit model checkers. In: Proceedings of the 14th International SPIN Workshop on Model Checking Software, 2007. 263–267
Gopalakrishnan G, Kirby R M, Siegel S F, et al. Formal analysis of MPI-based parallel programs. Commun ACM, 2011, 54: 82–91
Article Google Scholar
Siegel S F. Verifying parallel programs with mpi-spin. In: Proceedings of Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2007. 13–14
Luo Z Q, Zheng M C, Siegel S F. Verification of MPI programs using CIVL. In: Proceedings of the 24th European MPI Users’ Group Meeting, 2017. 6: 1–11
Google Scholar
Yu H B, Chen Z B, Fu X J, et al. Combining symbolic execution and model checking to verify MPI programs. 2018. ArXiv: 1803.06300
King J C. Symbolic execution and program testing. Commun ACM, 1976, 19: 385–394
Article MathSciNet MATH Google Scholar
Gibson-Robinson T, Armstrong P, Boulgakov A, et al. A modern refinement checker for CSP. In: Proceedings of Tools and Algorithms for the Construction and Analysis of Systems, 2014. 187–201
Lattner C. Llvm and clang: next generation compiler technology. In: Proceedings of the BSD Conference, 2008. 1–2
Hoare C A R. Communicating Sequential Processes. Upper Saddle River: Prentice-Hall, 1985
MATH Google Scholar
Scattergood J B. The semantics and implementation of machine-readable CSP. Dissertation for Ph.D. Degree. Oxford: University of Oxford, 1998
Google Scholar
McMillan K L. Symbolic model checking. Norwell: Kluwer Academic Publishers, 1993
Book MATH Google Scholar
Baier C, Katoen J. Principles of Model Checking. Cambridge: MIT Press, 2008
MATH Google Scholar
Siegel S F, Zirkel T K. TASS: the toolkit for accurate scientific software. Math Comput Sci, 2011, 5: 395–426
Article MATH Google Scholar
Xue R N, Liu X Z, Wu M, et al. Mpiwiz: subgroup reproducible replay of mpi applications. In: Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009. 251–260
Müller M, de Supinski B, Gopalakrishnan G, et al. Dealing with mpi bugs at scale: Best practices, automatic detection, debugging, and formal verification. 2011
Vakkalanka S. Efficient dynamic verification algorithms for MPI applications. 2010
Thompson J D, Higgins D G, Gibson T J. Clustalw: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res, 1994, 22: 4673–4680
Article Google Scholar
Lattner C, Adve V S. LLVM: a compilation framework for lifelong program analysis & transformation. In: Proceedings of the 2nd IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2004), 2004. 75–88
Just R, Jalali D, Inozemtseva L, et al. Are mutants a valid substitute for real faults in software testing? In: Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2014. 654–665
Newman M E J. The structure and function of complex networks. SIAM Rev, 2003, 45: 167–256
Article MathSciNet MATH Google Scholar
Hermann L R. Laplacian-isoparametric grid generation scheme. J Eng Mech Div, 1976, 102: 749–907
Article Google Scholar
Godefroid P. Partial-order methods for the verification of concurrent systems — an approach to the state-explosion problem. In: Lecture Notes in Computer Science. Berlin: Springer, 1996
MATH Google Scholar
McKeeman W M. Differential testing for software. Digit Tech J, 1998, 10: 100–107
Google Scholar
Vakkalanka S S, Gopalakrishnan G, Kirby R M. Dynamic verification of MPI programs with reductions in presence of split operations and relaxed orderings. In: Proceedings of the 20th International Conference on Computer Aided Verification, 2008. 66–79
Vakkalanka S S, Sharma S, Gopalakrishnan G, et al. ISP: a tool for model checking MPI programs. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008. 285–286
Forejt V, Joshi S, Kroening D, et al. Precise predictive analysis for discovering communication deadlocks in MPI programs. ACM Trans Program Lang Syst, 2017, 39: 1–27
Article Google Scholar
Blom S, van de Pol J, Weber M. Ltsmin: distributed and symbolic reachability. In: Proceedings of the 22nd International Conference on Computer Aided Verification, 2010. 354–359
Lal A, Reps T W. Reducing concurrent analysis under a context bound to sequential analysis. Form Methods Syst Des, 2009, 35: 73–97
Article MATH Google Scholar
Laarman A, van de Pol J, Weber M. Boosting multi-core reachability performance with shared hash tables. In: Proceedings of the 10th International Conference on Formal Methods in Computer-Aided Design, 2010. 247–255
Kwiatkowska M Z, Norman G, Parker D. The PRISM benchmark suite. In: Proceedings of the 9th International Conference on Quantitative Evaluation of Systems, 2012. 203–204
Atiya D A, Catano N, Lüttgen G. Towards a benchmark for model checkers of asynchronous concurrent systems. In: Proceedings of the 5th International Workshop on Automated Verification of Critical Systems (AVOCs), 2005. 98: 142–170
Google Scholar

Download references

Acknowledgements

This work was supported by National Key R&D Program of China (Grant No. 2017YFB1001802) and National Natural Science Foundation of China (Garnt Nos. 61472440, 61632015, 61690203, 61532007).

Author information

Authors and Affiliations

College of Computer, National University of Defense Technology, Changsha, 410073, China
Weijiang Hong, Zhenbang Chen, Hengbiao Yu & Ji Wang
State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha, 410073, China
Weijiang Hong, Hengbiao Yu & Ji Wang

Authors

Weijiang Hong
View author publications
You can also search for this author in PubMed Google Scholar
Zhenbang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hengbiao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Ji Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zhenbang Chen or Ji Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hong, W., Chen, Z., Yu, H. et al. Evaluation of model checkers by verifying message passing programs. Sci. China Inf. Sci. 62, 200101 (2019). https://doi.org/10.1007/s11432-018-9825-3

Download citation

Received: 30 July 2018
Revised: 24 October 2018
Accepted: 06 March 2019
Published: 03 September 2019
DOI: https://doi.org/10.1007/s11432-018-9825-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Evaluation of model checkers by verifying message passing programs

Abstract

Access this article

Similar content being viewed by others

Reliable benchmarking: requirements and solutions

Formal Runtime Error Detection During Development in the Automotive Industry

Combining Model Checking and Data-Flow Analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Evaluation of model checkers by verifying message passing programs

Abstract

Access this article

Similar content being viewed by others

Reliable benchmarking: requirements and solutions

Formal Runtime Error Detection During Development in the Automotive Industry

Combining Model Checking and Data-Flow Analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation