skip to main content
10.1145/3299902.3309753acmconferencesArticle/Chapter ViewAbstractPublication PagesispdConference Proceedingsconference-collections
research-article

Graceful Register Clustering by Effective Mean Shift Algorithm for Power and Timing Balancing

Published: 04 April 2019 Publication History

Abstract

As the wide adoption of FinFET technology in mass production, dynamic power becomes the bottleneck to achieving low power. Therefore, clock power reduction is crucial in modern IC design. Register clustering can effectively save clock power because of significantly reducing the number of clock sinks and register pin capacitance, clock routed wirelength, and the number of clock buffers. In this paper, we propose effective mean shift to naturally form clusters according to register distribution without placement disruption. Effective mean shift fulfills the requirements to be a good register clustering algorithm because it needs no prespecified number of clusters, is insensitive to initializations, is robust to outliers, is tolerant of various register distributions, is efficient and scalable, and balances clock power reduction against timing degradation. Experimental results show that our approach outperforms state-of-the-art work on power and timing balancing, as well as efficiency and scalability.

References

[1]
A. Ranjan. 2015. Micro-architectural exploration for low power design. (November 2015). Semiconductor Engineering. Retrieved from https://semiengineering.com/micro-architectural-exploration-for-low-power-design/
[2]
K. Brock. 2016. Six ways to exploit the advantages of finFETs. (November 2016.). Tech Design Forum. Retrieved from http://www.techdesignforums.com/practice/technique/six-ways-to-exploit-the-advantages-of-finfets/
[3]
L. Rizzatti. 2015. Dynamic power estimation hits limits of SoC designs. (May 2015). Retrieved from https://www.eetimes.com/author.asp'section_id=36&doc_id=1326542
[4]
S. I. Ward, N. Viswanathan, N. Y. Zhou, C. C. N. Sze, Z. Li, C. J. Alpert, and D. Z. Pan. 2013. Clock power minimization using structured latch templates and decision tree induction. In Proc. Int'l Conf. on Computer-Aided Design (ICCAD '13). IEEE, Piscataway, NJ, USA, 599--606.
[5]
D. A. Papa, C. J. Alpert, C. C. N. Sze, Z. Li, N. Viswanathan, G.-J. Nam, I. L. Markov. 2011. Physical synthesis with clock-network optimization for large systems on chips. IEEE Micro 31, 4 (July 2011), 51--62.
[6]
M. P.-H. Lin, C. C. Hsu and Y.-T. Chang. 2011. Post-placement power optimization with multi-bit flip-flops. IEEE Trans. on CAD of Integrated Circuits and Systems (TCAD) 30, 12 (December 2011), 1870--1882.
[7]
I. H.-R. Jiang, C.-L. Chang, and Y.-M. Yang. 2012. INTEGRA: Fast multibit flip-flop clustering for clock power saving. IEEE Trans. on CAD of Integrated Circuits and Systems (TCAD) 31, 2 (February 2012), 192--204. Also see in Proc. Int'l Symp. on Physical Design (ISPD '11). ACM, New York, NY, 115--121.
[8]
S.-H. Wang, Y.-Y. Liang, T.-Y. Kuo, and W.-K. Mak. 2012. Power-driven flip-flop merging and relocation. IEEE Trans. on CAD of Integrated Circuits and Systems (TCAD) 31, 2 (February 2012), 180--191. Also see in Proc. Int'l Symp. on Physical Design (ISPD '11). ACM, New York, NY, 107--114.
[9]
S. S.-Y. Liu, W.-T. Lo, C.-J. Lee, and H.-M. Chen. 2013. Agglomerative-based flip-flop merging and relocation for signal wirelength and clock tree optimization. ACM Trans. Design Automation Electronic Systems (TODAES) 18, 3, Article 40 (July 2013), 20 pages.
[10]
C.-C. Tsai, Y. Shi, G. Luo, and I. H.-R. Jiang. 2013. FF-Bond: Multi-bit flip-flop bonding at placement. In Proc. Int'l Symp. on Physical Design (ISPD '13). ACM, New York, NY, 147--153.
[11]
G. Wu, Y. Xu, D. Wu, M. Ragupathy, Y.-Y. Mo, and C. Chu. 2016. Flip-flop clustering by weighted K-means algorithm. 2016. In Proc. Design Automation Conf. (DAC '16). ACM, New York, NY, Article 82, 6 pages.
[12]
A. B. Kahng, J. Li, and L. Wang. 2016. Improved flop tray-based design implementation for power reduction. In Proc. Int'l Conf. on Computer-Aided Design (ICCAD '16). ACM, New York, NY, Article 20, 8 pages.
[13]
I. Seitanidis, G. Dimitrakopoulos, P. M. Mattheakis, L. Masse-Navette, D. Chinnery. 2018. Timing-driven and placement-aware multi-bit register composition. IEEE Trans. on CAD of Integrated Circuits and Systems (TCAD), early access. Also see in Proc. Design Automation Conf. (DAC '17). ACM, New York, NY, Article 56, 6 pages.
[14]
S. P. Lloyd. 1982. Least square quantization in PCM. IEEE Trans. Information Theory 28, 2 (March 1982), 129--137.
[15]
K. Fukunaga and L.D. Hostetler. 1975. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Trans. Information Theory 21 (January 1975), 32--40.
[16]
Y. Cheng. 1995. Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Analysis Machine Intelligence (TPAMI) 17, 8 (August 1995), 790--799.
[17]
D. Comaniciu and P. Meer. 2002. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Analysis Machine Intelligence (TPAMI) 24, 5 (May 2002), 603--619.
[18]
D. Comaniciu, V. Ramesh, and P. Meer. 2001. The variable bandwidth mean shift and data-driven scale selection. In Proc. Int'l Conf. on Computer Vision (ICCV '01). IEEE, Piscataway, NJ, USA, 438--445.
[19]
D. Gale and L. S. Shapley. 1962. College admissions and the stability of marriage. American Mathematical Monthly 69 (1962), 9--14.
[20]
M.-C. Kim, J. Hu, J. Li, and N. Viswanathan. 2015. ICCAD-2015 CAD contest in incremental timing-driven placement and benchmark suite. In Proc. Int'l Conf. on Computer-Aided Design (ICCAD '15). IEEE, Piscataway, NJ, USA, 921--926.
[21]
Innovus, Cadence, Inc.

Cited By

View all
  • (2025)Design and Utilization of Multiskewed Multibit Flip-Flop Cells for Timing Optimization: Design and Technology Co-Optimization ApproachIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.345783444:3(1084-1097)Online publication date: Mar-2025
  • (2025)Graceful Register Clustering and Rebanking for Power and Timing BalancingIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.344823344:3(1070-1083)Online publication date: Mar-2025
  • (2025)DG-RePlAce: A Dataflow-Driven GPU-Accelerated Analytical Global Placement Framework for Machine Learning AcceleratorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.343652144:2(696-708)Online publication date: Feb-2025
  • Show More Cited By

Index Terms

  1. Graceful Register Clustering by Effective Mean Shift Algorithm for Power and Timing Balancing

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISPD '19: Proceedings of the 2019 International Symposium on Physical Design
    April 2019
    164 pages
    ISBN:9781450362535
    DOI:10.1145/3299902
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 April 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clock power
    2. clustering
    3. mean shift
    4. register clustering
    5. timing

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    ISPD '19
    Sponsor:
    ISPD '19: International Symposium on Physical Design
    April 14 - 17, 2019
    CA, San Francisco, USA

    Acceptance Rates

    ISPD '19 Paper Acceptance Rate 12 of 25 submissions, 48%;
    Overall Acceptance Rate 62 of 172 submissions, 36%

    Upcoming Conference

    ISPD '25
    International Symposium on Physical Design
    March 16 - 19, 2025
    Austin , TX , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)221
    • Downloads (Last 6 weeks)12
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Design and Utilization of Multiskewed Multibit Flip-Flop Cells for Timing Optimization: Design and Technology Co-Optimization ApproachIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.345783444:3(1084-1097)Online publication date: Mar-2025
    • (2025)Graceful Register Clustering and Rebanking for Power and Timing BalancingIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.344823344:3(1070-1083)Online publication date: Mar-2025
    • (2025)DG-RePlAce: A Dataflow-Driven GPU-Accelerated Analytical Global Placement Framework for Machine Learning AcceleratorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.343652144:2(696-708)Online publication date: Feb-2025
    • (2024)Flip-Flop Centric Incremental Placement for Simultaneous Timing and Clock Network Power OptimizationProceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD10.1145/3670474.3685949(1-7)Online publication date: 9-Sep-2024
    • (2024)Slack Redistributed Register Clustering with Mixed-Driving Strength Multi-bit Flip-FlopsProceedings of the 2024 International Symposium on Physical Design10.1145/3626184.3633327(21-29)Online publication date: 12-Mar-2024
    • (2024)Flip-Flop Centric Incremental Placement for Simultaneous Timing and Clock Network Power Optimization2024 ACM/IEEE 6th Symposium on Machine Learning for CAD (MLCAD)10.1109/MLCAD62225.2024.10740200(1-7)Online publication date: 9-Sep-2024
    • (2022)Generation of Mixed-Driving Multi-Bit Flip-Flops for Power OptimizationProceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design10.1145/3508352.3549473(1-9)Online publication date: 30-Oct-2022
    • (2020)Latch clustering for timing-power co-optimizationProceedings of the 57th ACM/EDAC/IEEE Design Automation Conference10.5555/3437539.3437769(1-6)Online publication date: 20-Jul-2020
    • (2020)Latch Clustering for Timing-Power Co-Optimization2020 57th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC18072.2020.9218617(1-6)Online publication date: Jul-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media