SoK: A Tale of Reduction, Security, and Correctness - Evaluating Program Debloating Paradigms and Their Compositions

Ali, Muaz; Muzammil, Muhammad; Karim, Faraz; Naeem, Ayesha; Haroon, Rukhshan; Haris, Muhammad; Nadeem, Huzaifah; Sabir, Waseem; Shaon, Fahad; Zaffar, Fareed; Yegneswaran, Vinod; Gehani, Ashish; Rahaman, Sazzadur

doi:10.1007/978-3-031-51482-1_12

Muaz Ali¹¹,
Muhammad Muzammil¹²,
Faraz Karim¹³,
Ayesha Naeem¹⁴,
Rukhshan Haroon¹⁵,
Muhammad Haris¹⁶,
Huzaifah Nadeem¹⁷,
Waseem Sabir¹⁶,
Fahad Shaon¹⁸,
Fareed Zaffar¹⁶,
Vinod Yegneswaran¹⁹,
Ashish Gehani¹⁹ &
…
Sazzadur Rahaman¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14347))

Included in the following conference series:

European Symposium on Research in Computer Security

762 Accesses

Abstract

Automated software debloating of program source or binary code has tremendous potential to improve both application performance and security. Unfortunately, measuring and comparing the effectiveness of various debloating methods is challenging due to the absence of a universal benchmarking platform that can accommodate diverse approaches. In this paper, we first present ${\textsc {DebloatBench}}_{\textrm{A}}$ (Debloating benchmark for applications), an extensible and sustainable benchmarking platform that enables comparison of different research techniques. Then, we perform a holistic comparison of the techniques to assess the current progress.

In the current version, we integrated four software debloating research tools: Chisel, Occam, Razor, and Piece-wise. Each tool is representative of a different class of debloaters: program source, compiler intermediate representation, executable binary, and external library. Our evaluation revealed interesting insights (i.e., hidden and explicit tradeoffs) about existing techniques, which might inspire future research. For example, all the binaries produced by Occam and Piece-Wise were correct, while Chisel significantly outperformed others in binary size and Gadget class reductions. In a first-of-its-kind composition, we also combined multiple debloaters to debloat a single binary. Our performance evaluation showed that, in both ASLR-proof and Turing-complete gadget expressively cases, several compositions (e.g., Chisel-Occam, Chisel-Occam-Razor) significantly outperformed the best-performing single tool (i.e., Chisel).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

BinTrimmer: Towards Static Binary Debloating Through Abstract Interpretation

Soufflé: On Synthesis of Program Analyzers

Analyzing the Gadgets

Notes

1.
Some of the test cases are taken from Razor Benchmarks [37].

References

Busy box. https://busybox.net/
Depclean. https://github.com/castor-software/depclean
Ropgadget tool. https://github.com/JonathanSalwan/ROPgadget
Abubakar, M., Ahmad, A., Fonseca, P., Xu, D.: Shard: fine-grained kernel specialization with context-aware hardening. In: 28th USENIX Security Symposium (2019)
Google Scholar
Agadakos, I., Jin, D., Williams-King, D., Kemerlis, V.P., Portokalidis, G.: Nibbler: debloating binary shared libraries. In: ACSAC, pp. 70–83 (2019)
Google Scholar
Ahmad, A., Anwar, M., Sharif, H., Gehani, A., Zaffar, F.: Trimmer: context-specific code reduction. In: 37th IEEE/ACM Conference on Automated Software Engineering (ASE) (2022)
Google Scholar
Ahmad, A., et al.: Trimmer: an automated system for configuration-based software debloating. IEEE Trans. Softw. Eng. (TSE) 48(9) (2022)
Google Scholar
Alhanahnah, M., Jain, R., Rastogi, V., Jha, S., Reps, T.: Lightweight, multi-stage, compiler-assisted application specialization. In: 7th European Symposium on Security and Privacy. IEEE (2022)
Google Scholar
Azad, B.A., Laperdrix, P., Nikiforakis, N.: Less is more: quantifying the security benefits of debloating web applications. In: 28th USENIX Security Symposium (2019)
Google Scholar
Bessey, A., et al.: A few billion lines of code later: using static analysis to find bugs in the real world. Commun. ACM 53(2), 66–75 (2010)
Article Google Scholar
Bhattacharya, S., Rajamani, K., Gopinath, K., Gupta, M.: The interplay of software bloat, hardware energy proportionality and system bottlenecks. In: HotPower’11, pp. 1–5 (2011)
Google Scholar
Bierbaumer, B., Kirsch, J., Kittel, T., Francillon, A., Zarras, A.: Smashing the stack protector for fun and profit. In: Janczewski, L.J., Kutyłowski, M. (eds.) SEC 2018. IAICT, vol. 529, pp. 293–306. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99828-2_21
Chapter Google Scholar
Biswas, P., Burow, N., Payer, M.: Code specialization through dynamic feature observation. In: Joshi, A., Carminati, B., Verma, R.M. (eds.) CODASPY ’21, pp. 257–268 (2021)
Google Scholar
Brown, M.D., Pande, S.: Is less really more? Towards better metrics for measuring security improvements realized through software debloating. In: 12th USENIX Workshop (CSET 19) (2019)
Google Scholar
Brown, M.D., Pruett, M., Bigelow, R., Mururu, G., Pande, S.: Not so fast: understanding and mitigating negative impacts of compiler optimizations on code reuse gadget sets. Proc. ACM Program. Lang. 5(OOPSLA) (2021)
Google Scholar
Bruce, B.R., Zhang, T., Arora, J., Xu, G.H., Kim, M.: JShrink: in-depth investigation into debloating modern Java applications. In: Devanbu, P., Cohen, M.B., Zimmermann, T. (eds.) ESEC/FSE, pp. 135–146. ACM (2020)
Google Scholar
Chaqfeh, M., Zaki, Y., Hu, J., Subramanian, L.: JScleaner: de-cluttering mobile webpages through Javascript cleanup. In: Huang, Y., King, I., Liu, T., van Steen, M. (eds.) WWW, pp. 763–773. ACM/IW3C2 (2020)
Google Scholar
Dewan, A., Rao, P., Sodhi, B., Kapur, R.: BloatLibD: detecting bloat libraries in Java applications. In: 16th Conference on the Evaluation of Novel Approaches to Software Engineering (2021)
Google Scholar
GuardSquare: Proguard. https://github.com/Guardsquare/proguard
Guo, P.J., Engler, D.R.: CDE: using system call interposition to automatically create portable software packages. In: Nieh, J., Waldspurger, C.A. (eds.) USENIX ATC (2011)
Google Scholar
Hassan, M., et al.: Evaluating container debloaters. In: IEEE Secure Development Conference, SecDev 2023, Atlanta, GA, USA, 18–20 October 2023. IEEE (2023)
Google Scholar
Heo, K., Lee, W., Pashakhanloo, P., Naik, M.: Effective program debloating via reinforcement learning. In: 2018 ACM CCS, pp. 380–394 (2018)
Google Scholar
Holzmann, G.J.: Code inflation. IEEE Softw. 32(2), 10–13 (2015)
Article MathSciNet Google Scholar
Homescu, A., Stewart, M., Larsen, P., Brunthaler, S., Franz, M.: Microgadgets: size does matter in Turing-Complete Return-Oriented programming. In: USENIX WOOT ’12 (2012)
Google Scholar
Javed, F., Afzal, M.K., Sharif, M., Kim, B.S.: Internet of things (IoT) operating systems support, networking technologies, applications, and challenges: a comparative review. IEEE CS &T 20(3), 2062–2100 (2018)
Google Scholar
Jiang, Y., Wu, D., Liu, P.: JRed: program customization and bloatware mitigation based on static analysis. In: IEEE COMPSAC, pp. 12–21 (2016)
Google Scholar
Jones, N.D.: An introduction to partial evaluation. ACM Comput. Surv. 28(3), 480–503 (1996)
Article Google Scholar
Kalhauge, C.G., Palsberg, J.: Logical bytecode reduction. In: ACM SIGPLAN PLDI, pp. 1003–1016. ACM (2021)
Google Scholar
Kuo, H., et al.: Multik: a framework for orchestrating multiple specialized kernels. CoRR abs/1903.06889 (2019)
Google Scholar
Kupoluyi, T., Chaqfeh, M., Varvello, M., Hashmi, W., Subramanian, L., Zaki, Y.: Muzeel: a dynamic Javascript analyzer for dead code elimination in today’s web. arXiv preprint arXiv:2106.08948 (2021)
Malecha, G., Gehani, A., Shankar, N.: Automated software winnowing. In: 30th ACM Symposium on Applied Computing (SAC) (2015)
Google Scholar
Martin, R.C.: The open-closed principle. More C++ Gems 19(96) (1996)
Google Scholar
Navas, J., Gehani, A.: OCCAMv2: combining static and dynamic analysis for effective and efficient whole program specialization. Commun. ACM 66(4) (2023)
Google Scholar
Necula, G.C., McPeak, S., Rahul, S.P., Weimer, W.: CIL: intermediate language and tools for analysis and transformation of C programs. In: Horspool, R.N. (ed.) Conference on Compiler Construction (2002)
Google Scholar
Obbink, N.G., Malavolta, I., Scoccia, G.L., Lago, P.: An extensible approach for taming the challenges of Javascript dead code elimination. In: Oliveto, R., Penta, M.D., Shepherd, D.C. (eds.) Conference on Software Analysis, Evolution and Reengineering (2018)
Google Scholar
Porter, C., Mururu, G., Barua, P., Pande, S.: Blankit library debloating: getting what you want instead of cutting what you don’t. In: ACM SIGPLAN PLDI, pp. 164–180 (2020)
Google Scholar
Qian, C., Hu, H., Alharthi, M., Chung, P.H., Kim, T., Lee, W.: Razor: a framework for post-deployment software debloating. In: USENIX Security (2019)
Google Scholar
Quach, A., Erinfolami, R., Demicco, D., Prakash, A.: A multi-OS cross-layer study of bloating in user programs, kernel and managed execution environments. In: Kim, T., Wang, C., Wu, D. (eds.) Workshop on Forming an Ecosystem Around Software Transformation (2017)
Google Scholar
Quach, A., Prakash, A., Yan, L.: Debloating software through piece-wise compilation and loading. In: USENIX Security, pp. 869–886 (2018)
Google Scholar
Ramanathan, M.K., Clapp, L., Barik, R., Sridharan, M.: Piranha: reducing feature flag debt at UBER. In: Rothermel, G., Bae, D. (eds.) ICSE-SEIP, pp. 221–230. ACM (2020)
Google Scholar
Rastogi, V., Davidson, D., Carli, L.D., Jha, S., McDaniel, P.D.: Cimplifier: automatically debloating containers. In: Bodden, E., Schäfer, W., van Deursen, A., Zisman, A. (eds.) European Software Engineering Conference/Foundations of Software Engineering (2017)
Google Scholar
Regehr, J., Chen, Y., Cuoq, P., Eide, E., Ellison, C., Yang, X.: Test-case reduction for C compiler bugs. In: ACM PLDI, pp. 335–346 (2012)
Google Scholar
Shacham, H.: The geometry of innocent flesh on the bone: return-into-libc without function calls (on the x86). In: Ning, P., di Vimercati, S.D.C., Syverson, P.F. (eds.) ACM CCS 2007, pp. 552–561. ACM (2007)
Google Scholar
Sharif, H., Abubakar, M., Gehani, A., Zaffar, F.: Trimmer: application specialization for code debloating. In: 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE) (2018)
Google Scholar
Smowton, C.S.: I/O Optimisation and elimination via partial evaluation. Technical report, UC, CL, December 2014
Google Scholar
Sun, C., Li, Y., Zhang, Q., Gu, T., Su, Z.: Perses: syntax-guided program reduction. In: ICSE 2018, pp. 361–371 (2018)
Google Scholar
Tip, F., Laffra, C., Sweeney, P.F., Streeter, D.: Practical experience with an application extractor for Java. SIGPLAN Not. 34(10), 292–305 (1999)
Google Scholar
Turcotte, A., Arteca, E., Mishra, A., Alimadadi, S., Tip, F.: Stubbifier: debloating dynamic server-side Javascript applications. CoRR abs/2110.14162 (2021)
Google Scholar
Vázquez, H.C., Bergel, A., Vidal, S.A., Pace, J.A.D., Marcos, C.A.: Slimming Javascript applications: an approach for removing unused functions from Javascript libraries. Inf. Softw. Technol. 107, 18–29 (2019)
Article Google Scholar
Wu, J., et al.: LightBlue: automatic profile-aware debloating of Bluetooth stacks. In: 30th USENIX Security Symposium (2021)
Google Scholar
Xin, Q., Kim, M., Zhang, Q., Orso, A.: Program debloating via stochastic optimization. In: ICSE-NIER ’20, pp. 65–68 (2020)
Google Scholar
Xin, Q., Kim, M., Zhang, Q., Orso, A.: Subdomain-based generality-aware debloating. In: 35th IEEE/ACM ASE (2020)
Google Scholar
Xu, G., Mitchell, N., Arnold, M., Rountev, A., Sevitsky, G.: Software bloat analysis: finding, removing, and preventing performance problems in modern large-scale object-oriented applications. In: FSE/SDP, pp. 421–426 (2010)
Google Scholar

Download references

Acknowledgements

This material is based upon work supported by the National Science Foundation (NSF) under Grant ACI-1440800 and the Office of Naval Research (ONR) under Contracts N68335-17-C-0558 and N00014-18-1-2660. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of NSF or ONR. We thank Muhammad Hassan, Abdullah Naveed, Talha Tahir, Muhammad Farrukh, and Ahsan Amin for their help in preparing and testing the large application suite.

Author information

Authors and Affiliations

University of Arizona, Tucson, USA
Muaz Ali & Sazzadur Rahaman
Stony Brook University, Stony Brook, USA
Muhammad Muzammil
Georgia Institute of Technology, Atlanta, USA
Faraz Karim
Boston University, Boston, USA
Ayesha Naeem
Tufts University, Medford, USA
Rukhshan Haroon
LUMS, Lahore, Pakistan
Muhammad Haris, Waseem Sabir & Fareed Zaffar
University of Pittsburgh, Pittsburgh, USA
Huzaifah Nadeem
Data Security Technologies, Richardson, USA
Fahad Shaon
SRI International, Menlo Park, USA
Vinod Yegneswaran & Ashish Gehani

Authors

Muaz Ali
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Muzammil
View author publications
You can also search for this author in PubMed Google Scholar
Faraz Karim
View author publications
You can also search for this author in PubMed Google Scholar
Ayesha Naeem
View author publications
You can also search for this author in PubMed Google Scholar
Rukhshan Haroon
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Haris
View author publications
You can also search for this author in PubMed Google Scholar
Huzaifah Nadeem
View author publications
You can also search for this author in PubMed Google Scholar
Waseem Sabir
View author publications
You can also search for this author in PubMed Google Scholar
Fahad Shaon
View author publications
You can also search for this author in PubMed Google Scholar
Fareed Zaffar
View author publications
You can also search for this author in PubMed Google Scholar
Vinod Yegneswaran
View author publications
You can also search for this author in PubMed Google Scholar
Ashish Gehani
View author publications
You can also search for this author in PubMed Google Scholar
Sazzadur Rahaman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muaz Ali .

Editor information

Editors and Affiliations

University of California, Irvine, CA, USA
Gene Tsudik
University of Padua, Padua, Italy
Mauro Conti
Delft University of Technology, Delft, The Netherlands
Kaitai Liang
Delft University of Technology, Delft, The Netherlands
Georgios Smaragdakis

Appendix

Table 3. The number of train and correctness testcases we used during our experiment for each of the input programs.

Full size table

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ali, M. et al. (2024). SoK: A Tale of Reduction, Security, and Correctness - Evaluating Program Debloating Paradigms and Their Compositions. In: Tsudik, G., Conti, M., Liang, K., Smaragdakis, G. (eds) Computer Security – ESORICS 2023. ESORICS 2023. Lecture Notes in Computer Science, vol 14347. Springer, Cham. https://doi.org/10.1007/978-3-031-51482-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-51482-1_12
Published: 11 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-51481-4
Online ISBN: 978-3-031-51482-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SoK: A Tale of Reduction, Security, and Correctness - Evaluating Program Debloating Paradigms and Their Compositions

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

BinTrimmer: Towards Static Binary Debloating Through Abstract Interpretation

Soufflé: On Synthesis of Program Analyzers

Analyzing the Gadgets

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us