Precise Regression Benchmarking with Random Effects: Improving Mono Benchmark Results

Kalibera, Tomas; Tuma, Petr

doi:10.1007/11777830_5

Tomas Kalibera¹⁸ &
Petr Tuma¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4054))

Included in the following conference series:

European Performance Engineering Workshop

357 Accesses
14 Citations

Abstract

Benchmarking as a method of assessing software performance is known to suffer from random fluctuations that distort the observed performance. In this paper, we focus on the fluctuations caused by compilation. We show that the design of a benchmarking experiment must reflect the existence of the fluctuations if the performance observed during the experiment is to be representative of reality.

We present a new statistical model of a benchmark experiment that reflects the presence of the fluctuations in compilation, execution and measurement. The model describes the observed performance and makes it possible to calculate the optimum dimensions of the experiment that yield the best precision within a given amount of time.

Using a variety of benchmarks, we evaluate the model within the context of regression benchmarking. We show that the model significantly decreases the number of erroneously detected performance changes in regression benchmarking.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Following the Blind Seer – Creating Better Performance Models Using Less Information

Applying test case prioritization to software microbenchmarks

Article Open access 30 September 2021

Computer comparisons in the presence of performance variation

Article 29 December 2018

References

Smith, C.U., Williams, L.G.: Performance Solutions: A Practical Guide to Creating Responsive, Scalable Software. Addison–Wesley, Reading (2001)
Google Scholar
Kalibera, T., Bulej, L., Tuma, P.: Benchmark precision and random initial state. In: Proceedings of SPECTS 2005, SCS, pp. 853–862 (2005)
Google Scholar
Kalibera, T., Bulej, L., Tuma, P.: Automated detection of performance regressions: The Mono experience. In: MASCOTS, pp. 183–190. IEEE Computer Society, Los Alamitos (2005)
Google Scholar
Bulej, L., Kalibera, T., Tuma, P.: Repeated results analysis for middleware regression benchmarking. Performance Evaluation 60, 345–358 (2005)
Article Google Scholar
Lo, S.L., Grisby, D., Riddoch, D., Weatherall, J., Scott, D., Richardson, T., Carroll, E., Evers, D., Meerwald, C.: Free high performance orb. (2006), http://omniorb.sourceforge.net
Novell, Inc.: The Mono Project (2006), http://www.mono-project.com
ECMA: ECMA-335: Common Language Infrastructure (CLI). ECMA (2002)
Google Scholar
Distributed Systems Research Group: Mono regression benchmarking (2005), http://nenya.ms.mff.cuni.cz/projects/mono
Free Software Foundation: The gnu compiler collection (2006), http://gcc.gnu.org
Gu, D., Verbrugge, C., Gagnon, E.: Code layout as a source of noise in JVM performance. In: Component And Middleware Performance Workshop, OOPSLA 2004 (2004)
Google Scholar
Wasserman, L.: All of Statistics: A Concise Course in Statistical Inference. Springer, New York (2004)
MATH Google Scholar
Jain, R.: The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley–Interscience, New York (1991)
MATH Google Scholar
Buble, A., Bulej, L., Tuma, P.: CORBA benchmarking: A course with hidden obstacles. In: IPDPS, p. 279. IEEE Computer Society, Los Alamitos (2003)
Google Scholar
DOC Group: TAO performance scoreboard (2006), http://www.dre.vanderbilt.edu/stats/performance.shtml
Prochazka, M., Madan, A., Vitek, J., Liu, W.: RTJBench: A Real-Time Java Benchmarking Framework. In: Component And Middleware Performance Workshop, OOPSLA 2004 (2004)
Google Scholar
Weisstein, E.W.: Mathworld–a wolfram web resource (2006), http://mathworld.wolfram.com

Download references

Author information

Authors and Affiliations

Distributed Systems Research Group, Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Malostranske nam. 25, 118 00, Prague, Czech Republic
Tomas Kalibera & Petr Tuma

Authors

Tomas Kalibera
View author publications
You can also search for this author in PubMed Google Scholar
Petr Tuma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Informatica, Università di Torino, Corso Svizzera 185, 10149, Torino, Italy
András Horváth
Department of Telecommunications, Technical University of Budapest, 1521, Budapest, Hungary
Miklós Telek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kalibera, T., Tuma, P. (2006). Precise Regression Benchmarking with Random Effects: Improving Mono Benchmark Results. In: Horváth, A., Telek, M. (eds) Formal Methods and Stochastic Models for Performance Evaluation. EPEW 2006. Lecture Notes in Computer Science, vol 4054. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11777830_5

Download citation

DOI: https://doi.org/10.1007/11777830_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35362-1
Online ISBN: 978-3-540-35365-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics