research-article

Method for Reducing Overhead of Shared Memory Access Instrumentation

Authors:
Qianyu Liu

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China
View Profile

,
Naijie Gu

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China
View Profile

,
Junjie Su

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China

Department of Computer Science, University of Science and Technology of China, Hefei, Anhui, China
View Profile

CSAE '19: Proceedings of the 3rd International Conference on Computer Science and Application EngineeringOctober 2019Article No.: 9Pages 1–6https://doi.org/10.1145/3331453.3361323

Published:22 October 2019Publication History

CSAE '19: Proceedings of the 3rd International Conference on Computer Science and Application Engineering

Pages 1–6

ABSTRACT

Memory monitoring is crucial for understanding the memory access behavior of applications. Especially in multithreaded programs, dealing with concurrency bugs relies on tracking and analyzing accesses to shared memory. Instrumentation is widely used to obtain diagnostic information for runtime checks. However, instrumenting all memory accesses incurs a high performance overhead, slowing down a program's execution by an order of magnitude. In this paper, a simple but novel method is proposed to address performance degradation problem caused by instrumentation. It is based on the following key insight: there is no need to track those thread-local stack accesses. Recognizing such redundancy in memory access instrumentation and runtime checks, the paper presents the IIMA (Is Interesting Memory Access) algorithm to conduct instrumentation pruning. The algorithm is implemented based on LLVM infrastructure and evaluated across a range of well-designed test cases and open source benchmarks. The results show that the method is able to aggressively reduce the amount of instrumented memory accesses especially at low compilation optimization level and further reduce the runtime overhead.

References

Wang, Haojie, et al. (2018). Spindle: informed memory access monitoring. 2018 {USENIX} Annual Technical Conference ({USENIX}{ATC} 18).Google Scholar
Liu, Lei, et al. (2012). A software memory partition approach for eliminating bank-level interference in multicore systems. Proceedings of the 21st international conference on Parallel architectures and compilation techniques. ACM.Google ScholarDigital Library
Wen, Shasha, Milind Chabbi, and Xu Liu (2017). REDSPY: exploring value locality in software. ACM SIGARCH Computer Architecture News. Vol. 45. No. 1. ACM.Google Scholar
Voung, Jan Wen, Ranjit Jhala, and Sorin Lerner (2007). RELAY: static race detection on millions of lines of code. Proceedings of the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering. ACM.Google ScholarDigital Library
Xie, Xinwei, Jingling Xue, and Jie Zhang (2013). Acculock: Accurate and efficient detection of data races. Software: Practice and Experience 43.5:543--576.Google ScholarCross Ref
Bruening, Derek, and Qin Zhao (2011). Practical memory checking with Dr. Memory. Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization. IEEE Computer Society.Google ScholarDigital Library
Serebryany, Konstantin, et al. (2012). AddressSanitizer: A fast address sanity checker. Presented as part of the 2012 {USENIX} Annual Technical Conference ({USENIX}{ATC} 12).Google Scholar
Arnold, Matthew, and Barbara G. Ryder (2001). A framework for reducing the cost of instrumented code. Acm Sigplan Notices36.5:168--179.Google Scholar
Marino, Daniel, Madanlal Musuvathi and Satish Narayanasamy (2009). LiteRace: effective sampling for lightweight data-race detection. ACM Sigplan notices. Vol. 44. No. 6. ACM.Google Scholar
Serebryany, Konstantin, et al. (2011). Dynamic race detection with LLVM compiler. International Conference on Runtime Verification. Springer, Berlin, Heidelberg.Google Scholar
Hauswirth, Matthias, and Trishul M. Chilimbi (2004). Low-overhead memory leak detection using adaptive statistical profiling. Acm SIGPLAN notices. Vol. 39. No. 11. ACM.Google Scholar
Erickson, John, et al. (2010). Effective Data-Race Detection for the Kernel. OSDI. Vol. 10. No. 10.Google Scholar
Lattner, Chris, and Vikram Adve (2004). LLVM: A compilation framework for lifelong program analysis & transformation. Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization. IEEE Computer Society.Google ScholarDigital Library
Xavier-Baudry (2015). Parallelism-Benchmark. https://github.com/Xavier-Baudry/Parallelism-Benchmark.Google Scholar
NPB3.0-omp-C (2014). https://github.com/benchmark-subsetting/NPB3.0-omp-C.Google Scholar
The Fcd tool. (2017). https://github.com/zneak/fcd.Google Scholar
The Dagger tool. (2017). https://github.com/repzret/dagger.Google Scholar
Hardekopf, Ben, and Calvin Lin (2011). Flow-sensitive pointer analysis for millions of lines of code. Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization. IEEE Computer Society.Google ScholarDigital Library

Index Terms

Method for Reducing Overhead of Shared Memory Access Instrumentation
1. General and reference
  1. Cross-computing tools and techniques
    1. Performance
2. Software and its engineering
  1. Software notations and tools
    1. Compilers
      1. Runtime environments

Recommendations

Hybrid binary rewriting for memory access instrumentation
VEE '11: Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments

Memory access instrumentation is fundamental to many applications such as software transactional memory systems, profiling tools and race detectors. We examine the problem of efficiently instrumenting memory accesses in x86 machine code to support ...
Read More
Hybrid binary rewriting for memory access instrumentation
VEE '11

Memory access instrumentation is fundamental to many applications such as software transactional memory systems, profiling tools and race detectors. We examine the problem of efficiently instrumenting memory accesses in x86 machine code to support ...
Read More
Residual Runtime Verification via Reachability Analysis
Verified Software. Theories, Tools and Experiments.
Abstract
We leverage static verification to reduce monitoring overhead when runtime verifying a property. We present a sound and efficient analysis to statically find safe execution paths in the control flow at the intra-procedural level of programs. Such ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CSAE '19: Proceedings of the 3rd International Conference on Computer Science and Application Engineering
October 2019
942 pages
ISBN:9781450362948
DOI:10.1145/3331453
Conference Chair:
Ali Emrouznejad,
Program Chair:
Zeshui Xu
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 October 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Instrumentation
LLVM
Memory access
Runtime overhead
Shared memory
multithreaded programs
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate368of770submissions,48%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 72
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.