research-article

UCA: An Energy-efficient Hybrid Uncore Architecture in 3D Chip-Multiprocessors to minimize crosstalk

Authors:

Pooneh Safayenikoo,

Arghavan Asad,

Kaamran Raahemifar,

Mahmood FathyAuthors Info & Claims

NoCArc '16: Proceedings of the 9th International Workshop on Network on Chip Architectures

Pages 39 - 44

https://doi.org/10.1145/2994133.2994136

Published: 15 October 2016 Publication History

Get Access

Abstract

With technology scaling, the number of uncore components increases on a chip in Chip-Multiprocessors (CMPs). As the number of cores increases, power consumption becomes the main concern in Network on Chip (NoC) and Last Level Cache (LLC). Emerging technologies, such as three-dimensional integrated circuits (3D ICs) and non-volatile memories (NVMs) are among the newest solutions to the design of dark-silicon-aware multi/many-core systems. In on-chip interconnection networks, components must be activated for each access, consequently the energy of NoC increases. Although NVMs have many advantages like low leakage and high density, they suffer from shortcomings such as the limited number of write operations and long write operation latency and high energy. In this paper, we propose a new architecture called Uncore-Coding Architecture (UCA) to simultaneously target the short lifetime of NVM LLC and the crosstalk problem of Through-Silicon-Vias (TSVs). This architecture identifies frequent values at runtime in order to encode these values using limited weight codes and therefore reduce the number of bit flips to minimize energy and crosstalk in NoC. Furthermore, this encoding can also improve the life of NVMs integrated into the LLC. Experimental results show that the proposed method improves energy by about 30% on average under PARSEC workloads execution. Moreover, this technique provides Average Memory Access Time approximately, on average, equal to the conventional methods with SRAM cache technology under PARSEC workloads execution.

References

[1]

Esmaeilzadeh, H., Blem, E., Amant, R.S., Sankaralingam, K., and Burger, D. (2011). Dark silicon and the end of multicore scaling. In Computer Architecture (ISCA), 38th Annual International Symposium on (IEEE).

Digital Library

Google Scholar

[2]

Pavlidis, V.F., and Friedman, E.G. (2010). Three-dimensional integrated circuit design (Morgan Kaufmann).

Digital Library

Google Scholar

[3]

Wang, W., and Mishra, P. (2012). System-wide leakage-aware energy minimization using dynamic voltage scaling and cache reconfiguration in multitasking systems. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 20, 902--910.

Digital Library

Google Scholar

[4]

Muthyam, M. (2004). A bus encoding technique for power and crosstalk minimization.

Google Scholar

[5]

Zou, Q., Niu, D., Cao, Y., and Xie, Y. (2014). 3dlat: Tsvbased 3d ICs crosstalk minimization utilizing less adjacent transition code. In 19th Asia and South Pacific Design Automation Conference (ASP-DAC) (IEEE), pp. 762--767.

Crossref

Google Scholar

[6]

Zhan, J., Poremba, M., Xu, Y., and Xie, Y. (2014). NoΔ: Leveraging delta compression for end-to-end memory access in NoC based multicores. In 19th Asia and South Pacific Design Automation Conference (ASP-DAC) (IEEE), pp. 586--591.

Google Scholar

[7]

Das, S., Aamodt, T.M., and Dally, W.J. (2015). SLIP: reducing wire energy in the memory hierarchy. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ACM), pp. 349--361.

Digital Library

Google Scholar

[8]

Chang, Y.-Y., Huang, Y.S.-C., Narayanan, V., and King, C.-T. (2013). ShieldUS: A novel design of dynamic shielding for eliminating 3D TSV crosstalk coupling noise. In Design Automation Conference (ASP-DAC), 18th Asia and South Pacific (IEEE), pp. 675--680.

Google Scholar

[9]

Dong, X., Xu, C., Jouppi, N., and Xie, Y. (2014). NVSim: A circuit-level performance, energy, and area model for emerging non-volatile memory. In Emerging Memory Technologies (Springer), pp. 15--50.

Google Scholar

[10]

Muralimanohar, N., Balasubramonian, R., and Jouppi, N.P. (2009). CACTI 6.0: A tool to model large caches. HP Laboratories, 22--31.

Google Scholar

[11]

Gebhart, M., Hestness, J., Fatehi, E., Gratz, P., and Keckler, S.W. (2009). Running PARSEC 2.1 on M5. The University of Texas at Austin, Department of Computer Science, Tech

Google Scholar

[12]

Yang, J., and Gupta, R. (2002). Energy efficient frequent value data cache design. In Microarchitecture, (MICRO-35). Proceedings. 35th Annual IEEE/ACM International Symposium on (IEEE), pp. 197--207.

Digital Library

Google Scholar

[13]

Binkert, N., Beckmann, B., Black, G., Reinhardt, S.K., Saidi, A., Basu, A., Hestness, J., Hower, D.R., Krishna, T., and Sardashti, S. (2011). The gem5 simulator. ACM SIGARCH Computer Architecture News 39, 1--7.

Digital Library

Google Scholar

[14]

Li, S., Ahn, J.H., Strong, R.D., Brockman, J.B., Tullsen, D.M., and Jouppi, N.P. (2009). McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures. In Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture (ACM), pp. 469--480.

Digital Library

Google Scholar

Cited By

View all

Safayenikoo PAsad AFathy MMohammadi F(2018)NIZCache: Energy-efficient Non-uniform Cache Architecture for Chip-multiprocessors Based on Invalid and Zero Lines2018 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS.2018.8351128(1-5)Online publication date: 2018
https://doi.org/10.1109/ISCAS.2018.8351128
Safayenikoo PAsad AMohammadi F(2018)An Energy-Efficient Cache Architecture for Chip-Multiprocessors Based on Non-Uniformity Accesses2018 IEEE Canadian Conference on Electrical & Computer Engineering (CCECE)10.1109/CCECE.2018.8447736(1-4)Online publication date: May-2018
https://doi.org/10.1109/CCECE.2018.8447736
Safayenikoo PAsad AFathy MMohammadi F(2017)Exploiting non-uniformity of write accesses for designing a high-endurance hybrid Last Level Cache in 3D CMPs2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE.2017.7946727(1-5)Online publication date: Apr-2017
https://doi.org/10.1109/CCECE.2017.7946727

Recommendations

Exploiting Heterogeneity in Cache Hierarchy in Dark-Silicon 3D Chip Multi-processors
DSD '15: Proceedings of the 2015 Euromicro Conference on Digital System Design

Technology scaling has enabled increasing number of cores on a chip in Chip-Multiprocessors (CMPs). As the number of cores increases, the overall system will need to provide more cache resources to feed all the cores. However, increasing the size of ...
Write activity reduction on non-volatile main memories for embedded chip multiprocessors

Recent advances in circuit and semiconductor technologies have pushed Non-Volatile Memory (NVM) technologies into a new era. These technologies exhibit appealing properties such as low power consumption, non-volatility, shock-resistivity, and high ...
Lighting the Dark-Silicon 3D Chip Multi-processors by Exploiting Heterogeneity in Cache Hierarchy
MCSOC '15: Proceedings of the 2015 IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip

This paper addresses a set of design paradigms by exploiting device and architectural heterogeneity to mitigate the dark silicon. We exploit Non-Volatile Memory (NVM) as potential replacements to conventional caches. Also, we study the problem of ...

Comments

Information & Contributors

Information

Published In

NoCArc '16: Proceedings of the 9th International Workshop on Network on Chip Architectures

October 2016

56 pages

ISBN:9781450347921

DOI:10.1145/2994133

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

NoCArc'16

NoCArc'16: 9th International Workshop on Network on Chip Architectures

October 15, 2016

Taipei, Taiwan

Acceptance Rates

NoCArc '16 Paper Acceptance Rate 8 of 20 submissions, 40%;

Overall Acceptance Rate 46 of 122 submissions, 38%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
157
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Safayenikoo PAsad AFathy MMohammadi F(2018)NIZCache: Energy-efficient Non-uniform Cache Architecture for Chip-multiprocessors Based on Invalid and Zero Lines2018 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS.2018.8351128(1-5)Online publication date: 2018
https://doi.org/10.1109/ISCAS.2018.8351128
Safayenikoo PAsad AMohammadi F(2018)An Energy-Efficient Cache Architecture for Chip-Multiprocessors Based on Non-Uniformity Accesses2018 IEEE Canadian Conference on Electrical & Computer Engineering (CCECE)10.1109/CCECE.2018.8447736(1-4)Online publication date: May-2018
https://doi.org/10.1109/CCECE.2018.8447736
Safayenikoo PAsad AFathy MMohammadi F(2017)Exploiting non-uniformity of write accesses for designing a high-endurance hybrid Last Level Cache in 3D CMPs2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE.2017.7946727(1-5)Online publication date: Apr-2017
https://doi.org/10.1109/CCECE.2017.7946727

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Recommendations

Exploiting Heterogeneity in Cache Hierarchy in Dark-Silicon 3D Chip Multi-processors

Write activity reduction on non-volatile main memories for embedded chip multiprocessors

Lighting the Dark-Silicon 3D Chip Multi-processors by Exploiting Heterogeneity in Cache Hierarchy

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations