Article

On the latency, energy and area of checkpointed, superscalar register alias tables

Authors:
Elham Safi

University of Toronto

University of Toronto
View Profile

,
Patrick Akl

University of Toronto

University of Toronto
View Profile

,
Andreas Moshovos

University of Toronto

University of Toronto
View Profile

,
Andreas Veneris

University of Toronto

University of Toronto
View Profile

,
Aggeliki Arapoyianni

University of Athens

University of Athens
View Profile

ISLPED '07: Proceedings of the 2007 international symposium on Low power electronics and designAugust 2007Pages 379–382https://doi.org/10.1145/1283780.1283863

Published:27 August 2007Publication History

ISLPED '07: Proceedings of the 2007 international symposium on Low power electronics and design

Pages 379–382

ABSTRACT

We present two full-custom implementations of the Register Alias Table (RAT) for a 4-way superscalar dynamically-scheduled processor in a commercial 130nm CMOS technology. The implementations differ in the way they organize the embedded global checkpoints (GCs) which support speculative execution. In the first implementation, representative of early designs, the GCs are organized as shift registers. In the second implementation, representative of more recent proposals, the GCs are organized as random access buffers. We measure the impact of increasing thenumber of GCs on the latency, energy, and area of the RAT. The results support the importance of recent techniques that reduce the number of GCs while maintaining performance.

References

H. Akkary, et al., "An Analysis of Resource Efficient Checkpoint Architecture", ACM Transaction on Architecture and Code Optimization, 1(4), Dec. 2004. Google ScholarDigital Library
B. Bishop, et al., "The Design of a Register Renaming unit", Great Lakes Symposium on VLSI, Mar. 1999 Google ScholarDigital Library
A. De Gloria, et al. , "An Application Specific Multi-Port RAM Cell Circuit for Register Renaming Units in High Speed Microprocessors", IEEE International Symposium on Circuits and Systems, 4:934--937, May 2001.Google Scholar
R. Heald et al., "A Third-Generation SPARC V9 64-b Microprocessor", IEEE Journal of Solid-State Circuits, 35(11) : 1526--1538, Nov. 2000.Google ScholarCross Ref
R. K. Krishnamurthy, et al., "130-nm 6-GHz 256 - 32 bit Leakage-Tolerant Register File", IEEE Journal of Solid-State Circuits,37(5): 624--632, May 2002Google ScholarCross Ref
G. Kuçuk, et.al," Reducing Power Dissipation of Register Alias Tables in High-Performance Processors, IEE Proceedings on Computers and Digital Techniques, 152(6): 739--746, Nov. 2005.Google ScholarCross Ref
A. Moshovos, "Checkpointing Alternatives for High Performance, Power-Aware Processors", IEEE International Symposium on Low Power Electronic and Design, 318--321, Aug. 2003 Google ScholarDigital Library
S. Palacharla, "Complexity-effective Superscalar Processors", Ph.D. Thesis, University of Wisconsin-Madison, 1998. Google ScholarDigital Library
R. Sangireddy, "Reducing Rename Logic Complexity for High-Speed and Low-Power Front-End Architectures", IEEE Transactions of Computers, 55(6):672--685, Jun. 2006. Google ScholarDigital Library
D. Tarjan, S. Thoziyoor and N. P. Jouppi, CACTI 4.0, HP Labs Technical Report HPL-2006-86, 2006.Google Scholar
K. C. Yeager, "The MIPS R10000 Superscalar Microprocessor", IEEE MICRO, 1996. Google ScholarDigital Library
V. Zyuban, "Inherently Lower-Power High-Performance Superscalar Architectures", PhD Thesis, University of Notre Dame, Jan. 2000. Google ScholarDigital Library

Index Terms

On the latency, energy and area of checkpointed, superscalar register alias tables
1. Computer systems organization
  1. Architectures
2. Hardware
  1. Integrated circuits
  2. Very large scale integration design

Recommendations

A physical level study and optimization of CAM-based checkpointed register alias table
ISLPED '08: Proceedings of the 2008 international symposium on Low Power Electronics & Design

Using full-custom layouts in 130 nm technology, this work studies how the latency and energy of a checkpointed, CAM-based Register Alias Table (cRAT) vary as a function of the window size, the issue width, and the number of embedded global checkpoints (...
Read More
On the latency and energy of checkpointed superscalar register alias tables

This paper investigates how the latency and energy of register alias tables (RATs) vary as a function of the number of global checkpoints (GCs), processor issue width, and window size. It improves upon previous RAT checkpointing work that ignored the ...
Read More
Virtual register renaming
ARCS'13: Proceedings of the 26th international conference on Architecture of Computing Systems

This paper presents a novel high performance substrate for building energy-efficient out-of-order superscalar cores. The architecture does not require a reorder buffer or physical registers for register renaming and instruction retirement. Instead, it ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ISLPED '07: Proceedings of the 2007 international symposium on Low power electronics and design
August 2007
432 pages
ISBN:9781595937094
DOI:10.1145/1283780
General Chairs:
Diana Marculescu
Carnegie Mellon University
,
Anand Raghunathan
NEC Laboratories America
,
Program Chairs:
Ali Keshavarzi
Intel Corporation
,
Vijaykrishnan Narayanan
Penn State University
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 August 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
checkpointing
energy
latency
register renaming
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate398of1,159submissions,34%
Upcoming Conference
ISLPED '24

Sponsor:

sigda

ACM/IEEE International Symposium on Low Power Electronics and Design

August 5 - 7, 2024

Newport Beach , CA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 10
  Total Citations
  View Citations
- 133
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On the latency, energy and area of checkpointed, superscalar register alias tables

ISLPED '07: Proceedings of the 2007 international symposium on Low power electronics and design

ABSTRACT

References

Cited By

Index Terms

Recommendations

A physical level study and optimization of CAM-based checkpointed register alias table

On the latency and energy of checkpointed superscalar register alias tables

Virtual register renaming