Evolutionary Grammar-Based Fuzzing

Eberlein, Martin; Noller, Yannic; Vogel, Thomas; Grunske, Lars

doi:10.1007/978-3-030-59762-7_8

Martin Eberlein¹⁰,
Yannic Noller¹⁰,
Thomas Vogel¹⁰ &
…
Lars Grunske¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 12420))

Included in the following conference series:

International Symposium on Search Based Software Engineering

1445 Accesses
3 Altmetric

Abstract

A fuzzer provides randomly generated inputs to a targeted software to expose erroneous behavior. To efficiently detect defects, generated inputs should conform to the structure of the input format and thus, grammars can be used to generate syntactically correct inputs. In this context, fuzzing can be guided by probabilities attached to competing rules in the grammar, leading to the idea of probabilistic grammar-based fuzzing. However, the optimal assignment of probabilities to individual grammar rules to effectively expose erroneous behavior for individual systems under test is an open research question. In this paper, we present EvoGFuzz, an evolutionary grammar-based fuzzing approach to optimize the probabilities to generate test inputs that may be more likely to trigger exceptional behavior. The evaluation shows the effectiveness of EvoGFuzz in detecting defects compared to probabilistic grammar-based fuzzing (baseline). Applied to ten real-world applications with common input formats (JSON, JavaScript, or CSS3), the evaluation shows that EvoGFuzz achieved a significantly larger median line coverage for all subjects by up to 48% compared to the baseline. Moreover, EvoGFuzz managed to expose 11 unique defects, from which five have not been detected by the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

IFuzzer: An Evolutionary Interpreter Fuzzer Using Genetic Programming

CMFuzz: context-aware adaptive mutation for fuzzers

Article 16 January 2021

GrammarForge: Learning Program Input Grammars for Fuzz Testing

Notes

1.
Data and code artifacts are available here: https://doi.org/10.5281/zenodo.3961374.

References

Anand, S., et al.: An orchestrated survey of methodologies for automated software test case generation. JSS 86(8), 1978–2001 (2013)
Google Scholar
Arcuri, A., Briand, L.: A Hitchhiker’s guide to statistical tests for assessing randomized algorithms in software engineering. Softw. Test. Verif. Reliab. 24(3), 219–250 (2014)
Article Google Scholar
Atlidakis, V., Geambasu, R., Godefroid, P., Polishchuk, M., Ray, B.: Pythia: grammar-based fuzzing of REST APIs with coverage-guided feedback and learning-based mutations, pp. 1–12 (2020). http://arxiv.org/abs/2005.11498
Böhme, M., Pham, V.T., Nguyen, M.D., Roychoudhury, A.: Directed greybox fuzzing. In: Proceedings of the ACM SIGSAC Conference on Computer and Communications Security, CCS 2017, pp. 2329–2344. ACM (2017)
Google Scholar
Böhme, M., Pham, V.T., Roychoudhury, A.: Coverage-based greybox fuzzing as markov chain. In: Proceedings of the ACM SIGSAC Conference on Computer and Communications Security, CCS 2016, pp. 1032–1043. ACM (2016)
Google Scholar
Cummins, C., Petoumenos, P., Murray, A., Leather, H.: Compiler fuzzing through deep learning. In: Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2018, pp. 95–105. ACM (2018)
Google Scholar
Du, H., Wang, Z., Zhan, W., Guo, J.: Elitism and distance strategy for selection of evolutionary algorithms. IEEE Access 6, 44531–44541 (2018)
Article Google Scholar
Godefroid, P.: Fuzzing: hack, art, and science. Commun. ACM 63(2), 70–76 (2020)
Article Google Scholar
Godefroid, P., Kiezun, A., Levin, M.Y.: Grammar-based whitebox fuzzing. In: Proceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2008, pp. 206–215. ACM (2008)
Google Scholar
Godefroid, P., Levin, M.Y., Molnar, D.: SAGE: whitebox fuzzing for security testing. Commun. ACM 55(3), 40–44 (2012)
Article Google Scholar
Godefroid, P., Peleg, H., Singh, R.: Learn&Fuzz: machine learning for input fuzzing. In: Proceedings of the 32nd International Conference on Automated Software Engineering, ASE 2017, pp. 50–59. IEEE (2017)
Google Scholar
Hallaraker, O., Vigna, G.: Detecting malicious JavaScript code in Mozilla. In: Proceedings of the 10th IEEE International Conference on Engineering of Complex Computer Systems, ICECCS 2005, pp. 85–94. IEEE (2005)
Google Scholar
Hanford, K.V.: Automatic generation of test cases. IBM Syst. J. 9(4), 242–257 (1970)
Article Google Scholar
Harman, M., McMinn, P., de Souza, J.T., Yoo, S.: Search based software engineering: techniques, taxonomy, tutorial. In: Meyer, B., Nordio, M. (eds.) LASER 2008-2010. LNCS, vol. 7007, pp. 1–59. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-25231-0_1
Chapter Google Scholar
Holler, C., Herzig, K., Zeller, A.: Fuzzing with code fragments. In: Presented as part of the 21st USENIX Security Symposium, pp. 445–458. USENIX (2012)
Google Scholar
Höschele, M., Zeller, A.: Mining input grammars with autogram. In: 39th International Conference on Software Engineering Companion, pp. 31–34. IEEE (2017)
Google Scholar
Klees, G., Ruef, A., Cooper, B., Wei, S., Hicks, M.: Evaluating fuzz testing. In: Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, CCS 2018, pp. 2123–2138. ACM (2018)
Google Scholar
Le, X.B.D., Păsăreanu, C., Padhye, R., Lo, D., Visser, W., Sen, K.: Saffron: adaptive grammar-based fuzzing for worst-case analysis. SIGSOFT Softw. Eng. Notes 44(4), 14 (2019)
Article Google Scholar
Lemieux, C., Sen, K.: FairFuzz: a targeted mutation strategy for increasing greybox fuzz testing coverage. In: Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE, pp. 475–485. ACM (2018)
Google Scholar
Liu, P., Zhang, X., Pistoia, M., Zheng, Y., Marques, M., Zeng, L.: Automatic text input generation for mobile testing. In: Proceedings of the 39th International Conference on Software Engineering, ICSE 2017, pp. 643–653. IEEE (2017)
Google Scholar
Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50–60 (1947)
Article MathSciNet MATH Google Scholar
Miller, B.P., Fredriksen, L., So, B.: An empirical study of the reliability of UNIX utilities. Commun. ACM 33(12), 32–44 (1990)
Article Google Scholar
Miller, B.L., Goldberg, D.E.: Genetic algorithms, tournament selection, and the effects of noise. Complex Syst. 9, 193–212 (1995)
MathSciNet Google Scholar
Miller, J.C., Maloney, C.J.: Systematic mistake analysis of digital computer programs. Commun. ACM 6(2), 58–63 (1963)
Article MATH Google Scholar
Nilizadeh, S., Noller, Y., Păsăreanu, C.S.: Diffuzz: differential fuzzing for side-channel analysis. In: Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, pp. 176–187. IEEE (2019)
Google Scholar
Orso, A., Rothermel, G.: Software testing: a research travelogue (2000–2014). In: Future of Software Engineering, FOSE 2014, pp. 117–132. ACM (2014)
Google Scholar
Pacheco, C., Ernst, M.D.: Randoop: feedback-directed random testing for Java. In: Proceedings of the 22nd Conference on Object-Oriented Programming Systems and Applications Companion, OOPSLA 2007, pp. 815–816. ACM (2007)
Google Scholar
Pavese, E., Soremekun, E., Havrikov, N., Grunske, L., Zeller, A.: Inputs from hell: generating uncommon inputs from common samples. arXiv:1812.07525 [cs] (2018). http://arxiv.org/abs/1812.07525
Pham, V.T., Böhme, M., Santosa, A.E., Căciulescu, A.R., Roychoudhury, A.: Smart greybox fuzzing. IEEE Trans. Softw. Eng., 1–17 (2019). https://doi.org/10.1109/TSE.2019.2941681
Richardson, R.: CSI computer crime and security survey. Comput. Secur. Inst. 1, 1–30 (2008)
Google Scholar
Song, D., et al.: BitBlaze: a new approach to computer security via binary analysis. In: Sekar, R., Pujari, A.K. (eds.) ICISS 2008. LNCS, vol. 5352, pp. 1–25. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-89862-7_1
Chapter Google Scholar
Veggalam, S., Rawat, S., Haller, I., Bos, H.: IFuzzer: an evolutionary interpreter fuzzer using genetic programming. In: Askoxylakis, I., Ioannidis, S., Katsikas, S., Meadows, C. (eds.) ESORICS 2016. LNCS, vol. 9878, pp. 581–601. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45744-4_29
Chapter Google Scholar
Wang, J., Chen, B., Wei, L., Liu, Y.: Superion: grammar-aware greybox fuzzing. In: Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, pp. 724–735. IEEE (2019)
Google Scholar
Website: American Fuzzing Lop (AFL) (2018). http://lcamtuf.coredump.cx/afl/
Website: libFuzzer: a library for coverage-guided fuzz testing (2018). https://llvm.org/docs/LibFuzzer.html
Wright, S.: The evolution of dominance. Am. Nat. 63(689), 556–561 (1929)
Article Google Scholar
Yang, X., Chen, Y., Eide, E., Regehr, J.: Finding and understanding bugs in C compilers. SIGPLAN Not. 46(6), 283–294 (2011)
Article Google Scholar
Zeller, A., Gopinath, R., Böhme, M., Fraser, G., Holler, C.: The fuzzing book. In: The Fuzzing Book. Saarland University (2019). https://www.fuzzingbook.org/

Download references

Author information

Authors and Affiliations

Software Engineering Group, Humboldt-Universität zu Berlin, Berlin, Germany
Martin Eberlein, Yannic Noller, Thomas Vogel & Lars Grunske

Authors

Martin Eberlein
View author publications
You can also search for this author in PubMed Google Scholar
Yannic Noller
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Vogel
View author publications
You can also search for this author in PubMed Google Scholar
Lars Grunske
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Vogel .

Editor information

Editors and Affiliations

Monash University, Melbourne, VIC, Australia
Aldeida Aleti
Delft University of Technology, Delft, The Netherlands
Annibale Panichella

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Eberlein, M., Noller, Y., Vogel, T., Grunske, L. (2020). Evolutionary Grammar-Based Fuzzing. In: Aleti, A., Panichella, A. (eds) Search-Based Software Engineering. SSBSE 2020. Lecture Notes in Computer Science(), vol 12420. Springer, Cham. https://doi.org/10.1007/978-3-030-59762-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-59762-7_8
Published: 30 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59761-0
Online ISBN: 978-3-030-59762-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Evolutionary Grammar-Based Fuzzing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

IFuzzer: An Evolutionary Interpreter Fuzzer Using Genetic Programming

CMFuzz: context-aware adaptive mutation for fuzzers

GrammarForge: Learning Program Input Grammars for Fuzz Testing

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Evolutionary Grammar-Based Fuzzing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

IFuzzer: An Evolutionary Interpreter Fuzzer Using Genetic Programming

CMFuzz: context-aware adaptive mutation for fuzzers

GrammarForge: Learning Program Input Grammars for Fuzz Testing

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation