short-paper

MUBench: a benchmark for API-misuse detectors

Authors:

Mira MeziniAuthors Info & Claims

MSR '16: Proceedings of the 13th International Conference on Mining Software Repositories

Pages 464 - 467

https://doi.org/10.1145/2901739.2903506

Published: 14 May 2016 Publication History

Get Access

Abstract

Over the last few years, researchers proposed a multitude of automated bug-detection approaches that mine a class of bugs that we call API misuses. Evaluations on a variety of software products show both the omnipresence of such misuses and the ability of the approaches to detect them.

This work presents MuBench, a dataset of 89 API misuses that we collected from 33 real-world projects and a survey. With the dataset we empirically analyze the prevalence of API misuses compared to other types of bugs, finding that they are rare, but almost always cause crashes. Furthermore, we discuss how to use it to benchmark and compare API-misuse detectors.

References

[1]

C. Cifuentes, C. Hoermann, N. Keynes, L. Li, S. Long, E. Mealy, M. Mounteney, and B. Scholz. BegBunch: Benchmarking for C Bug Detection Tools. DEFECTS'09, pages 16--20. ACM, 2009.

Digital Library

Google Scholar

[2]

V. Dallmeier and T. Zimmermann. Extraction of Bug Localization Benchmarks from History. ASE'07, pages 433--436. ACM, 2007.

Digital Library

Google Scholar

[3]

M. Egele, D. Brumley, Y. Fratantonio, and C. Kruegel. An Empirical Study of Cryptographic Misuse in Android Applications. CCS'13, pages 73--84. ACM, 2013.

Digital Library

Google Scholar

[4]

Q. Gao, H. Zhang, J. Wang, Y. Xiong, L. Zhang, and H. Mei. Fixing Recurring Crash Bugs via Analyzing Q&A Sites. ASE'15, pages 307--318, 2015.

Digital Library

Google Scholar

[5]

M. Georgiev, S. Iyengar, S. Jana, R. Anubhai, D. Boneh, and V. Shmatikov. The Most Dangerous Code in the World: Validating SSL Certificates in Non-browser Software. CCS'12, pages 38--49. ACM, 2012.

Digital Library

Google Scholar

[6]

K. Herzig, S. Just, and A. Zeller. It's Not a Bug, It's a Feature: How Misclassification Impacts Bug Prediction. ICSE'13, pages 392--401. IEEE Press, 2013.

Digital Library

Google Scholar

[7]

R. Just, D. Jalali, and M. D. Ernst. Defects4J: A Database of Existing Faults to Enable Controlled Testing Studies for Java Programs. ISSTA'14, pages 437--440. ACM, 2014.

Digital Library

Google Scholar

[8]

D. Lazar, H. Chen, X. Wang, and N. Zeldovich. Why Does Cryptographic Software Fail?: A Case Study and Open Problems. APSys'14, pages 7:1--7:7. ACM, 2014.

Digital Library

Google Scholar

[9]

Z. Li and Y. Zhou. PR-Miner: Automatically Extracting Implicit Programming Rules and Detecting Violations in Large Software Code. ESEC/FSE'13, pages 306--315. ACM, 2005.

Digital Library

Google Scholar

[10]

S. Nadi, S. Krüger, M. Mezini, and E. Bodden. "Jumping Through Hoops": Why do Developers Struggle with Cryptography APIs? ICSE'16, 2016.

Digital Library

Google Scholar

[11]

H. A. Nguyen, T. T. Nguyen, N. H. Pham, J. Al-Kofahi, and T. N. Nguyen. Clone Management for Evolving Software. IEEE Trans. Softw. Eng., 38(5): 1008--1026, 2012.

Digital Library

Google Scholar

[12]

H. A. Nguyen, T. T. Nguyen, G. Wilson, Jr., A. T. Nguyen, M. Kim, and T. N. Nguyen. A Graph-based Approach to API Usage Adaptation. OOPSLA'10, pages 302--321. ACM, 2010.

Digital Library

Google Scholar

[13]

M. P. Robillard, E. Bodden, D. Kawrykow, M. Mezini, and T. Ratchford. Automated API Property Inference Techniques. IEEE Trans. Soft. Eng., 39:613--637, 2013.

Digital Library

Google Scholar

[14]

A. Wasylkowski and A. Zeller. Mining Temporal Specifications from Object Usage. ASE, 18(3):263--292, 2011.

Digital Library

Google Scholar

Cited By

View all

Firouzi EGhafari MEbrahimi M(2024)ChatGPT’s Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis ToolsProceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement10.1145/3674805.3695408(582-588)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3674805.3695408
Galappaththi ANadi STreude C(2024)An Empirical Study of API Misuses of Data-Centric LibrariesProceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement10.1145/3674805.3686685(245-256)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3674805.3686685
Ma YTian WGao XSun HLi LChristakis MPradel M(2024)API Misuse Detection via Probabilistic Graphical ModelProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3652112(88-99)Online publication date: 11-Sep-2024
https://dl.acm.org/doi/10.1145/3650212.3652112
Show More Cited By

Index Terms

MUBench: a benchmark for API-misuse detectors
1. Software and its engineering
  1. Software creation and management
    1. Software post-development issues
    2. Software verification and validation
      1. Software defect analysis

Recommendations

Detecting API-Misuse Based on Pattern Mining via API Usage Graph with Parameters
Theoretical Aspects of Software Engineering
Abstract
API misuse is a common issue that can trigger software crashes, bugs, and vulnerabilities. To address this problem, researchers have proposed pattern-based violation detectors that automatically extract patterns from code. However, these detectors ...
Are Neural Bug Detectors Comparable to Software Developers on Variable Misuse Bugs?
ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering

Debugging, that is, identifying and fixing bugs in software, is a central part of software development. Developers are therefore often confronted with the task of deciding whether a given code snippet contains a bug, and if yes, where. Recently, data-...
Detect Related Bugs from Source Code Using Bug Information
COMPSAC '10: Proceedings of the 2010 IEEE 34th Annual Computer Software and Applications Conference

Open source projects often maintain open bug repositories during development and maintenance, and the reporters often point out straightly or implicitly the reasons why bugs occur when they submit them. The comments about a bug are very valuable for ...

Comments

Information & Contributors

Information

Published In

MSR '16: Proceedings of the 13th International Conference on Mining Software Repositories

May 2016

544 pages

ISBN:9781450341868

DOI:10.1145/2901739

General Chair:
Miryung Kim
University of California, Los Angeles
,
Program Chairs:
Romain Robbes
University of Chile, Chile
,
Christian Bird
Microsoft Research

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 May 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Conference

ICSE '16

Sponsor:

ACM
SIGSOFT
IEEE-CS\DATC
TCSE

ICSE '16: 38th International Conference on Software Engineering

May 14 - 22, 2016

Texas, Austin

Upcoming Conference

ICSE 2025

2025 IEEE/ACM 46th International Conference on Software Engineering

April 26 - May 3, 2025

Ottawa , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

60
Total Citations
View Citations
730
Total Downloads

Downloads (Last 12 months)58
Downloads (Last 6 weeks)6

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Firouzi EGhafari MEbrahimi M(2024)ChatGPT’s Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis ToolsProceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement10.1145/3674805.3695408(582-588)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3674805.3695408
Galappaththi ANadi STreude C(2024)An Empirical Study of API Misuses of Data-Centric LibrariesProceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement10.1145/3674805.3686685(245-256)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3674805.3686685
Ma YTian WGao XSun HLi LChristakis MPradel M(2024)API Misuse Detection via Probabilistic Graphical ModelProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3652112(88-99)Online publication date: 11-Sep-2024
https://dl.acm.org/doi/10.1145/3650212.3652112
Li CZhang JTang YLi ZSun TSpinellis DConstantinou EBacchelli A(2024)Boosting API Misuse Detection via Integrating API Constraints from Multiple SourcesProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644904(14-26)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3643991.3644904
Nielebock SBlockhaus PKruger JOrtmeier FHuyen PTan SMechtaev SKhurshid S(2024)ASAP-Repair: API-Specific Automated Program Repair Based on API Usage GraphsProceedings of the 5th ACM/IEEE International Workshop on Automated Program Repair10.1145/3643788.3648011(1-4)Online publication date: 20-Apr-2024
https://dl.acm.org/doi/10.1145/3643788.3648011
Bockisch CEren DLehmann SNeufeld DTaentzer GEgyed AWimmer MChechik MCombemale B(2024)Mutation Testing of Java Bytecode: A Model-Driven ApproachProceedings of the ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems10.1145/3640310.3674103(237-248)Online publication date: 22-Sep-2024
https://dl.acm.org/doi/10.1145/3640310.3674103
Cai YYadavally AMishra AMontejo GNguyen TRoychoudhury APaiva AAbreu RStorey M(2024)Programming Assistant for Exception Handling with CodeBERTProceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3639188(1-13)Online publication date: 20-May-2024
https://dl.acm.org/doi/10.1145/3597503.3639188
Wei MHarzevili NHuang YYang JWang JWang SRoychoudhury APaiva AAbreu RStorey M(2024)Demystifying and Detecting Misuses of Deep Learning APIsProceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3639177(1-12)Online publication date: 20-May-2024
https://dl.acm.org/doi/10.1145/3597503.3639177
Yang DLiu KLei YLi LXie HLiu CWang ZMao XBissyandé T(2024)Demystifying API misuses in deep learning applicationsEmpirical Software Engineering10.1007/s10664-023-10413-929:2Online publication date: 16-Feb-2024
https://doi.org/10.1007/s10664-023-10413-9
Baek HLee MKim H(2024)CryptoLLM: Harnessing the Power of LLMs to Detect Cryptographic API MisuseComputer Security – ESORICS 202410.1007/978-3-031-70879-4_18(353-373)Online publication date: 5-Sep-2024
https://doi.org/10.1007/978-3-031-70879-4_18
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Detecting API-Misuse Based on Pattern Mining via API Usage Graph with Parameters

Are Neural Bug Detectors Comparable to Software Developers on Variable Misuse Bugs?

Detect Related Bugs from Source Code Using Bug Information