Exploiting Intra-function Correlation with the Global History Stack

Gao, Fei; Sair, Suleyman

doi:10.1007/11512622_19

Fei Gao²⁰ &
Suleyman Sair²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3553))

Included in the following conference series:

International Workshop on Embedded Computer Systems

Abstract

The demand for more computation power in high-end embedded systems has put embedded processors on parallel evolution track as the RISC processors. Caches and deeper pipelines are standard features on recent embedded microprocessors. As a result of this, some of the performance penalties associated with branch instructions in RISC processors are becoming more prevalent in these processors. As is the case in RISC architectures, designers have turned to dynamic branch prediction to alleviate this problem. Global correlating branch predictors take advantage of the influence past branches have on future ones. The conditional branch outcomes are recorded in a global history register (GHR). Based on the hypothesis that most correlation is among intra-function branches, we provide a detailed analysis of the Global History Stack (GHS) in this paper. The GHS saves the global history in the return address stack when a call instruction is executed. Following the subsequent return, the history is restored from the stack. In addition, to preserve the correlation between the callee branches and the caller branches following the call instruction, we save a few of the history bits coming from the end of the callee’s execution. We also investigate saving the GHR of a function in the Branch Target Buffer (BTB) when it returns so that it can be restored when that function is called again. Our results show that these techniques improve the accuracy of several global history based prediction schemes by 4% on average. Consequently, performance improvements as high as 13% are attained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Implementation and comparison of bi-modal dynamic branch prediction with static branch prediction schemes

Article 11 March 2021

An efficient branch predictor for improved accuracy of instruction level parallelism

Article 06 April 2021

Hardware-Based Sequential Consistency Violation Detection Made Simpler

References

ARM Ltd.: ARM1136 Technical Reference Manual, Version r0p2 (2004), http://www.arm.com
Intel Corp.: The Intel(R) XScale(TM) Microarchitecture Technical Summary (2000), http://www.intel.com/design/intelxscale/
Analog Devices Inc.: Analog Devices Blackfin Processor Data Sheet (2005), http://www.analog.com/processors/processors/blackfin/
Seznec, A., Felix, S., Krishnan, V., Sazeides, Y.: Design tradeoffs for the Alpha EV8 conditional branch predictor. In: Proc. Ann. Int. Symp. Comput. Architecture, pp. 295–306 (2002)
Google Scholar
Michaud, P., Seznec, A., Uhlig, R.: Trading conflict and capacity aliasing in conditional branch predictors. In: Proc. Ann. Int. Symp. Comput. Architecture (1997)
Google Scholar
Seznec, A., Michaud, P.: De-aliased hybrid branch predictors. Technical Report RR-3618, Inria (1999)
Google Scholar
Sprangle, E., Chappell, R., Alsup, M., Patt, Y.: The agree predictor: A mechanism for reducing negative branch history interference. In: Proc. Ann. Int. Symp. Comput. Architecture, pp. 284–291 (1997)
Google Scholar
Eden, A.N., Mudge, T.: The YAGS branch prediction scheme. In: Proc. Ann. ACM/IEEE Int. Symp. Microarchitecture, pp. 69–77 (1998)
Google Scholar
Lee, C.C., Chen, I.C., Mudge, T.N.: The bi-mode branch predictor. In: Proc. Ann. ACM/IEEE Int. Symp. Microarchitecture, pp. 4–13. Research Triangle Park, NC (1997)
Google Scholar
Calder, B., Grunwald, D., Zorn, B.: Quantifying behavioral differences between C and C++ programs. Journal of Programming Languages 2 (1994)
Google Scholar
Yeh, T.Y., Patt, Y.: Two-level adaptive branch prediction. In: Proc. Ann. Int. Symp. Microarchitecture (1991)
Google Scholar
McFarling, S.: Combining branch predictors. Technical Report TN-36, Digital Equipment Corporation, Western Research Lab (1993)
Google Scholar
Sechrest, S., Lee, C.C., Mudge, T.: Correlation and aliasing in dynamic branch predictors. In: Proc. Ann. Int. Symp. Comput. Architecture, pp. 22–32 (1996)
Google Scholar
Evers, M., Patel, S.J., Chappell, R.S., Patt, Y.N.: An analysis of correlation and predictability: What makes two-level branch predictors work. In: Proc. Ann. Int. Symp. Comput. Architecture, Barcelona, Spain, pp. 52–61 (1998)
Google Scholar
Thomas, R., Franklin, M., Wilkerson, C., Stark, J.: Improving branch prediction by dynamic dataflow-based identification of correlated branches from a large global history. In: Ann. Int. Symp. Comput. Architecture, San Diego, CA, pp. 314–323 (2003)
Google Scholar
Yeh, T.Y., Patt, Y.: Alternative implementations of two-level adaptive branch prediction. In: Proc. Ann. Int. Symp. Comput. Architecture (1992)
Google Scholar
Nair, R.: Dynamic path-based branch correlation. In: Proc. Ann. Int. Symp. Microarchitecture, pp. 15–23 (1995)
Google Scholar
Jacobson, Q., Rotenberg, E., Smith, J.E.: Path-based next trace prediction. In: Proc. Int. Symp. Microarchitecture, pp. 14–23 (1997)
Google Scholar
Burger, D.C., Austin, T.M.: The SimpleScalar tool set, version 2.0. Technical Report CS-TR-97-1342, U. of Wisconsin, Madison, WI (1997)
Google Scholar
Shivakumar, P., Jouppi, N.P.: Cacti 3.0: An integrated cache timing, power, and area model. Technical Report (2001)
Google Scholar
Stark, J., Evers, M., Patt, Y.N.: Variable length path branch prediction. In: Proc. Int. Conf. Architectural Support for Programming Languages and Operating Systems, pp. 170–179 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, NC State University, Raleigh, NC, 27695, USA
Fei Gao & Suleyman Sair

Authors

Fei Gao
View author publications
You can also search for this author in PubMed Google Scholar
Suleyman Sair
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Systems, Tampere University of Technology, P.O. Box 553, FI-33101, Tampere, Finland
Timo D. Hämäläinen
Computer Systems Architecture Group, University of Amsterdam, The Netherlands
Andy D. Pimentel
Tampere University of Technology, Korkeakoulunkatu 1, 33720, Tampere, Finland
Jarmo Takala
Computer Engineering Lab, TUDelft., Postbus 5031, 2600, Delft, GA, The Netherlands
Stamatis Vassiliadis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, F., Sair, S. (2005). Exploiting Intra-function Correlation with the Global History Stack. In: Hämäläinen, T.D., Pimentel, A.D., Takala, J., Vassiliadis, S. (eds) Embedded Computer Systems: Architectures, Modeling, and Simulation. SAMOS 2005. Lecture Notes in Computer Science, vol 3553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11512622_19

Download citation

DOI: https://doi.org/10.1007/11512622_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26969-4
Online ISBN: 978-3-540-31664-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Exploiting Intra-function Correlation with the Global History Stack

Abstract

Access this chapter

Preview

Similar content being viewed by others

Implementation and comparison of bi-modal dynamic branch prediction with static branch prediction schemes

An efficient branch predictor for improved accuracy of instruction level parallelism

Hardware-Based Sequential Consistency Violation Detection Made Simpler

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Exploiting Intra-function Correlation with the Global History Stack

Abstract

Access this chapter

Preview

Similar content being viewed by others

Implementation and comparison of bi-modal dynamic branch prediction with static branch prediction schemes

An efficient branch predictor for improved accuracy of instruction level parallelism

Hardware-Based Sequential Consistency Violation Detection Made Simpler

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation