skip to main content
10.1145/3566097.3568347acmconferencesArticle/Chapter ViewAbstractPublication PagesaspdacConference Proceedingsconference-collections
invited-talk
Public Access

Learning Based Spatial Power Characterization and Full-Chip Power Estimation for Commercial TPUs

Published:31 January 2023Publication History

ABSTRACT

In this paper, we propose a novel approach for the real-time estimation of chip-level spatial power maps for commercial Google Coral M.2 TPU chips based on a machine-learning technique for the first time. The new method can enable the development of more robust runtime power and thermal control schemes to take advantage of spatial power information such as hot spots that are otherwise not available. Different from the existing commercial multi-core processors in which real-time performance-related utilization information is available, the TPU from Google does not have such information. To mitigate this problem, we propose to use features that are related to the workloads of running different deep neural networks (DNN) such as the hyperparameters of DNN and TPU resource information generated by the TPU compiler. The new approach involves the offline acquisition of accurate spatial and temporal temperature maps captured from an external infrared thermal imaging camera under nominal working conditions of a chip. To build the dynamic power density map model, we apply generative adversarial networks (GAN) based on the workload-related features. Our study shows that the estimated total powers match the manufacturer's total power measurements extremely well. Experimental results further show that the predictions of power maps are quite accurate, with the RMSE of only 4.98mW/mm2, or 2.6% of the full-scale error. The speed of deploying the proposed approach on an Intel Core i7-10710U is as fast as 6.9ms, which is suitable for real-time estimation.

References

  1. "Critical Reliability Challenges for The International Technology Roadmap for Semiconductors (ITRS)," 2003. In International Sematech Technology Transfer Document 03024377A-TR, 2003.Google ScholarGoogle Scholar
  2. H. Esmaeilzadeh, E. Blem, R. St. Amant, K. Sankaralingam, and D. Burger, "Dark silicon and the end of multicore scaling," Micro, IEEE, vol. 32, pp. 122--134, May 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Taylor, "A landscape of the new dark silicon design regime," IEEE/ACM International Symposium on Microarchitecture, vol. 33, pp. 8--19, October 2013.Google ScholarGoogle Scholar
  4. K. Skadron, M. R. Stan, W. Huang, S. Velusamy, K. Sankaranarayanan, and D. Tarjan, "Temperature-aware microarchitecture," in Proc. Intl. Symp. on Computer Architecture, 2006.Google ScholarGoogle Scholar
  5. J. Kong, S. W. Chung, and K. Skadron, "Recent thermal management techniques for microprocessors," ACM Comput. Surv., vol. 44, pp. 13:1--13:42, jun 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Sadiqbatcha, J. Zhang, H. Zhao, H. Amrouch, J. Hankel, and S. X.-D. Tan, "Post-silicon heat-source identification and machine-learning-based thermal modeling using infrared thermal imaging," IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems, 2020.Google ScholarGoogle Scholar
  7. R. Joseph and M. Martonosi, "Run-time power estimation in high-performance microprocessors," in Proc. Int. Symp. on Low Power Electronics and Design (ISLPED), pp. 135--140, 2001.Google ScholarGoogle Scholar
  8. C. Isci and M. Martonosi, "Runtime power monitoring in high-end processors: Methodology and empirical data," in Proceedings of MICRO, 2003.Google ScholarGoogle Scholar
  9. W. Wu, L. Jin, J. Yang, P. Liu, and S. X.-D. Tan, "Efficient power modeling and software thermal sensing for runtime temperature monitoring," ACM Trans. on Design Automation of Electronics Systems, vol. 12, no. 3, pp. 1--29, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. K. Dev, A. N. Nowroz, and S. Reda, "Power mapping and modeling of multi-core processors," in International Symposium on Low Power Electronics and Design (ISLPED), pp. 39--44, Sept 2013.Google ScholarGoogle Scholar
  11. X. Wang, S. Farsiu, P. Milanfar, and A. Shakouri, "Power trace: An efficient method for extracting the power dissipation profile in an ic chip from its temperature map," IEEE Transactions on Components and Packaging Technologies, vol. 32, no. 2, pp. 309--316, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  12. R. Cochran, A. N. Nowroz, and S. Reda, "Post-silicon power characterization using thermal infrared emissions," in Proc. Int. Symp. on Low Power Electronics and Design (ISLPED), (New York, NY, USA), pp. 331--336, ACM, 2010.Google ScholarGoogle Scholar
  13. S. Paek, W. Shin, J. Sim, and L. Kim, "Powerfield: A probabilistic approach for temperature-to-power conversion based on markov random field theory," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 32, no. 10, pp. 1509--1519, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Nowroz, G. Woods, and S. Reda, "Power mapping of integrated circuits using ac-based thermography," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 21, pp. 1398--1409, aug 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. F. Beneventi, A. Bartolini, P. Vivet, and L. Benini, "Thermal analysis and interpolation techniques for a logic+ wideio stacked dram test chip," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 35, no. 4, pp. 623--636, 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Reda, K. Dev, and A. Belouchrani, "Blind identification of thermal models and power sources from thermal measurements," IEEE Sensors Journal, vol. 18, pp. 680--691, Jan 2018.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Zhang, S. Sadiqbatcha, M. O'Dea, H. Amrouch, and S. X.-D. Tan, "Full-chip power density and thermal map characterization for commercial microprocessors under heat sink cooling," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, pp. 1--1, 2021.Google ScholarGoogle Scholar
  18. S. Sadiqbatcha, J. Zhang, H. Amrouch, and S. X.-D. Tan, "Real-time full-chip thermal tracking: A post-silicon, machine learning perspective," IEEE Transactions on Computers, 2021.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Intel, "Intel Performance Counter Monitor (PCM)." https://software.intel.com/en-us/articles/intel-performance-counter-monitor.Google ScholarGoogle Scholar
  20. J. Zhang, S. Sadiqbatcha, W. Jin, and S. X. . Tan, "Accurate power density map estimation for commercial multi-core microprocessors," in 2020 Design, Automation and Test in Europe Conference and Exhibition (DATE), pp. 1085--1090, 2020.Google ScholarGoogle Scholar
  21. H. Amrouch and J. Henkel, "Lucid infrared thermography of thermally-constrained processors," in 2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), pp. 347--352, July 2015.Google ScholarGoogle Scholar
  22. S. Reda, K. Dev, and A. Belouchrani, "Blind identification of thermal models and power sources from thermal measurements," IEEE Sensors Journal, vol. 18, no. 2, pp. 680--691, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  23. M. Abadi et al., "TensorFlow: Large-scale machine learning on heterogeneous systems," 2015. Software available from tensorflow.org.Google ScholarGoogle Scholar
  24. N. Ahmed, T. Natarajan, and K. R. Rao, "Discrete cosine transform," IEEE Transactions on Computers, vol. C-23, pp. 90--93, Jan 1974.Google ScholarGoogle Scholar
  25. "Edge TPU Compiler." Available from coral.ai/docs/edgetpu/compiler.Google ScholarGoogle Scholar
  26. I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, "Generative adversarial nets," in Advances in Neural Information Processing Systems 27 (Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger, eds.), pp. 2672--2680, Curran Associates, Inc., 2014.Google ScholarGoogle Scholar
  27. M. Mirza and S. Osindero, "Conditional Generative Adversarial Nets," arXiv e-prints, p. arXiv:1411.1784, Nov. 2014.Google ScholarGoogle Scholar
  28. M. Arjovsky, S. Chintala, and L. Bottou, "Wasserstein GAN," arXiv e-prints, p. arXiv:1701.07875, Dec. 2017.Google ScholarGoogle Scholar

Index Terms

  1. Learning Based Spatial Power Characterization and Full-Chip Power Estimation for Commercial TPUs
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ASPDAC '23: Proceedings of the 28th Asia and South Pacific Design Automation Conference
          January 2023
          807 pages
          ISBN:9781450397834
          DOI:10.1145/3566097

          Copyright © 2023 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 January 2023

          Check for updates

          Qualifiers

          • invited-talk

          Acceptance Rates

          ASPDAC '23 Paper Acceptance Rate102of328submissions,31%Overall Acceptance Rate466of1,454submissions,32%

          Upcoming Conference

          ASPDAC '25
        • Article Metrics

          • Downloads (Last 12 months)79
          • Downloads (Last 6 weeks)19

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader