skip to main content
10.1145/3297663.3310304acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

A Study of Core Utilization and Residency in Heterogeneous Smart Phone Architectures

Published: 04 April 2019 Publication History

Abstract

In recent years, the smart phone platform has seen a rise in the number of cores and the use of heterogeneous clusters as in the Qualcomm Snapdragon, Apple A10 and the Samsung Exynos processors. This paper attempts to understand characteristics of mobile workloads, with measurements on heterogeneous multicore phone platforms with big and little cores. It answers questions such as the following: (i) Do smart phones need multiple cores of different types (eg: big or little)? (ii) Is it energy-efficient to operate with more cores (with less time) or fewer cores even if it might take longer? (iii)What are the best frequencies to operate the cores considering energy efficiency? (iv) Do mobile applications need out-of-order speculative execution cores with complex branch prediction? (v) Is IPC a good performance indicator for early design tradeoff evaluation while working on mobile processor design?
Using Geekbench and more than 3 dozen Android applications, and the Workload Automation tool from ARM, we measure core utilization, frequency residencies, and energy efficiency characteristics on two leading edge smart phones. Many characteristics of smartphone platforms are presented, and architectural implications of the observations as well as design considerations for future mobile processors are discussed. A key insight is that multiple big and complex cores are beneficial both from a performance as well as an energy point of view in certain scenarios. It is seen that 4 big cores are utilized during application launch and update phases of applications. Similarly, reboot using all 4 cores at maximum performance provides latency advantages. However, it consumes higher power and energy, and reboot with 2 cores was seen to be more energy efficient than reboot with 1 or 4 cores. Furthermore, inaccurate branch prediction is seen to result in up to 40% mis-speculated instructions in many applications, suggesting that it is important to improve the accuracy of branch predictors in mobile processors. While absolute IPCs are observed to be a poor predictor of benchmark scores, relative IPCs are useful for estimating the impact of microarchitectural changes on benchmark scores.

References

[1]
2019. Determining the TDP of Exynos 5 Dual. http://www.anandtech.com/show/6536/arm-vs-x86-the-real-showdown/13.
[2]
2019. Exynos. http://www.anandtech.com/show/9330/exynos-7420-deep-dive/2.
[3]
2019. GeekBench Suite. https://geekbench.com.
[4]
2019. In-depth with the Snapdragon 810's heat problems. https://arstechnica.com/gadgets/2015/04/in-depth-with-the-snapdragon-810s-heat-problems/.
[5]
2019. Odroid-XU3 Board. http://www.hardkernel.com.
[6]
2019. Workload Automation Tool Suite. https://github.com/ARM-software/workload-automation.
[7]
Rizwana Begum, David Werner, Mark Hempstead, Guru Prasad, and Geoffrey Challen. 2015. Energy-Performance Trade-offs on Energy-Constrained Devices with Multi-component DVFS. In IEEE Int. Symp. on Workload Characterization(IISWC).
[8]
Carole-Jean Wu Benjamin Gaudette and Sarma Vrudhula. 2016. Improving Smart-phone User Experience by Balancing Performance and Energy with Probabilistic QoS Guarantee. In International Symposium on High Performance Computer Architecture (HPCA).
[9]
Dileep Bhandarkar and Jason Ding. 1997. Performance characterization of the Pentium Pro processor. In Intl. Symp. on High-Performance Computer Architecture(HPCA).
[10]
Guilin Chen, Ozcan Ozturk, Guangyu Chen, and Mahmut Kandemir. 2006. Energy-aware code replication for improving reliability in embedded chip multiprocessors. In International SOC Conference.
[11]
G Chen, Liping Xue, Jungsub Kim, Kanwaldeep Sobti, Lanping Deng, Xiaobai Sun, Nikos Pitsianis, Chaitali Chakrabarti, M Kandemir, and Narayanan Vijaykrishnan.2006. Geometric tiling for reducing power consumption in structured matrix operations. In International SOC Conference.
[12]
Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, and John Sartori. 2017. Determining Application-specific Peak Power and Energy Requirements for Ultra-low Power Processors. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS).
[13]
Cao Gao, Anthony Gutierrez, Madhav Rajan, Ronald G. Dreslinski, Trevor Mudge,and Carole-Jean Wu. 2015. A Study of Mobile Device Utilization. In Intl. Symp. Performance Analysis of Systems and Software (ISPASS).
[14]
Lorenzo Gomez, Iulian Neamtiu, Tanzirul Azim, and Todd Millstein. 2013. RERAN: Timing- and Touch-sensitive Record and Replay for Android. In Proc. of International Conference on Software Engineering (ICSE).
[15]
A. Gutierrez, R.G. Dreslinski, T.F. Wenisch, T. Mudge, A. Saidi, C. Emmons, and N.Paver. 2011. Full-System Analysis and Characterization of Interactive Smartphone Applications. In IEEE Int. Symp. on Workload Characterization (IISWC).
[16]
Alexey Kopytov. 2019. Sysbench CPU Benchmark Suite. https://launchpad.net/sysbench.
[17]
Dhinakaran Pandiyan, Shin-Ying Lee, and Carole-Jean Wu. 2013. Performance, energy characterizations and architectural implications of an emerging mobile platform benchmark suite - MobileBench. InIEEE Int. Symp. on Workload Characterization (IISWC).
[18]
Partha Ranganathan. 2017. ISCA 2017 Keynote Speech, Video shown during the keynote speech, Toronto, Canada, June 25 2017. ACM ISCA(2017).
[19]
Wonik Seo, Daegil Im, Jeongim Choi, and Jaehyuk Huh. 2015. Big or Little: A Study of Mobile Interactive Applications on an Asymmetric Multi-core Platform. In IEEE Intl. Symp. on Workload Characterization (IISWC).
[20]
Karthik Swaminathan, Emre Kultursay, Vinay Saripalli, Vijaykrishnan Narayanan, Mahmut Kandemir, and Suman Datta. 2011. Improving energy efficiency of multi-threaded applications using heterogeneous CMOS-TFET multicores. In International Symposium on Low Power Electronics and Design (ISLPED).
[21]
Yuhao Zhu, Matthew Halpern, and Vijay Janapa Reddi. 2015. The Role of the CPU in Energy-Efficient Mobile Web Browsing. IEEE Micro35, 1 (2015), 26--33

Cited By

View all
  • (2024)Performance Measurement on Heterogeneous Processors with PAPIProceedings of the SC '24 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis10.1109/SCW63240.2024.00195(1551-1561)Online publication date: 17-Nov-2024
  • (2021)Rules for Optimal Static Tasks Scheduling in Heterogeneous Systems2021 10th Mediterranean Conference on Embedded Computing (MECO)10.1109/MECO52532.2021.9460240(1-4)Online publication date: 7-Jun-2021
  • (2021)Performance optimization opportunities in the Android software stackBenchCouncil Transactions on Benchmarks, Standards and Evaluations10.1016/j.tbench.2021.100003(100003)Online publication date: Nov-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICPE '19: Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering
April 2019
348 pages
ISBN:9781450362399
DOI:10.1145/3297663
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 April 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. energy efficiency
  2. frequency residency
  3. measurement
  4. multicore utilization
  5. smart phone cpus
  6. workload characterization

Qualifiers

  • Research-article

Conference

ICPE '19

Acceptance Rates

ICPE '19 Paper Acceptance Rate 13 of 71 submissions, 18%;
Overall Acceptance Rate 252 of 851 submissions, 30%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Performance Measurement on Heterogeneous Processors with PAPIProceedings of the SC '24 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis10.1109/SCW63240.2024.00195(1551-1561)Online publication date: 17-Nov-2024
  • (2021)Rules for Optimal Static Tasks Scheduling in Heterogeneous Systems2021 10th Mediterranean Conference on Embedded Computing (MECO)10.1109/MECO52532.2021.9460240(1-4)Online publication date: 7-Jun-2021
  • (2021)Performance optimization opportunities in the Android software stackBenchCouncil Transactions on Benchmarks, Standards and Evaluations10.1016/j.tbench.2021.100003(100003)Online publication date: Nov-2021
  • (undefined)Constructing a Supplementary Benchmark Suite to Represent Android Applications with User Interactions by using Performance CountersACM Transactions on Architecture and Code Optimization10.1145/3701999

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media