Cross-Layer Profiling of IoTBench

Zhang, Fan; Wang, Chenxi; Luo, Chunjie; Wang, Lei

doi:10.1007/978-981-97-0316-6_5

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14521))

Included in the following conference series:

International Symposium on Benchmarking, Measuring and Optimization

80 Accesses

Abstract

The rapid expansion of the Internet of Things (IoT) industry highlights the significance of workload characterization when evaluating microprocessors tailored for IoT applications. The streamlined yet comprehensive system stack of an IoT system is highly suitable for synergistic software and hardware co-design. This stack comprises various layers, including programming languages, frameworks, runtime environments, instruction set architectures (ISA), operating systems (OS), and microarchitecture. These layers can be bucketed into three primary categories: the intermediate representation (IR) layer, the ISA layer, and the microarchitecture layer. Consequently, conducting cross-layer workload characterization constitutes the initial stride in IoT design, especially in co-design. In this paper, we use a cross-layer profiling methodology to conduct an exhaustive analysis of IoTBench-an IoT workload benchmark. Each layer’s key metrics, including instruction, data, and branch locality, were meticulously examined. Experimental evaluations were performed on both ARM and X86 architectures. Our findings revealed general patterns in how IoTBench’s metrics fluctuate with different input data. Additionally, we noted that the same metrics could demonstrate varied characteristics across different layers, suggesting that isolated layer analysis might yield incomplete conclusions. Besides, our cross-layer profiling disclosed that the convolution task, characterized by deeply nested loops, significantly amplified branch locality at the microarchitecture layer on the ARM platform. Interestingly, optimization with the GNU C++ compiler (G++), intended to boost performance, had a counterproductive effect, exacerbating the branch locality issue and resulting in performance degradation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abdi, H., Williams, L.J., et al.: Normalizing data. Encyclopedia of research design 1 (2010)
Google Scholar
IoT Analytics: State of IoT-Spring 2023 (2023). https://iot-analytics.com/number-connected-iot-devices/
Bienia, C., Kumar, S., Singh, J.P., Li, K.: The PARSEC benchmark suite: characterization and architectural implications. In: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, pp. 72–81 (2008)
Google Scholar
Bruening, D., Zhao, Q., Kleckner, R.: DynamoRIO: dynamic instrumentation tool platform (2020). http://www.dynamorio.org
Chen, S., Luo, C., Gao, W., Wang, L.: IoTBench: a data centrical and configurable IoT benchmark suite. BenchCouncil Trans. Benchmarks Stand. Eval. 2(4), 100091 (2022)
Article Google Scholar
(EEMBC) EMBC: CoreMark Benchmark (2021). https://www.eembc.org/coremark/
Ferdman, M., et al.: Clearing the clouds: a study of emerging scale-out workloads on modern hardware. ACM SIGPLAN Not. 47(4), 37–48 (2012)
Article Google Scholar
Guthaus, M.R., Ringenberg, J.S., Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: MiBench: a free, commercially representative embedded benchmark suite. In: Proceedings of the Fourth Annual IEEE International Workshop on Workload Characterization, WWC-4 (Cat. No. 01EX538), pp. 3–14. IEEE (2001)
Google Scholar
Hoste, K., Eeckhout, L.: Microarchitecture-independent workload characterization. IEEE Micro 27(3), 63–72 (2007)
Article Google Scholar
Laghari, A.A., Wu, K., Laghari, R.A., Ali, M., Khan, A.A.: A review and state of art of Internet of Things (IoT). Arch. Comput. Methods Eng. 1–19 (2021)
Google Scholar
Lattner, C., Adve, V.: LLVM: a compilation framework for lifelong program analysis & transformation. In: International Symposium on Code Generation and Optimization, CGO 2004, pp. 75–86. IEEE (2004)
Google Scholar
Limaye, A., Adegbija, T.: A workload characterization of the SPEC CPU2017 benchmark suite. In: 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 149–158. IEEE (2018)
Google Scholar
Panda, R., Song, S., Dean, J., John, L.K.: Wait of a decade: did SPEC CPU 2017 broaden the performance horizon? In: 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA) (2018)
Google Scholar
Pearson, K.: On the theory of contingency and its relation to association and normal correlation (1904)
Google Scholar
Poovey, J.A., Conte, T.M., Levy, M., Gal-On, S.: A benchmark characterization of the EEMBC benchmark suite. IEEE Micro 29(5), 18–29 (2009)
Article Google Scholar
Shao, Y.S., Brooks, D.: ISA-independent workload characterization and its implications for specialized architectures. In: 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 245–255. IEEE (2013)
Google Scholar
Wang, L., Ren, R., Zhan, J., Jia, Z.: Characterization and architectural implications of big data workloads. In: 2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 145–146. IEEE (2016)
Google Scholar
Wang, L., et al.: WPC: whole-picture workload characterization. arXiv preprint arXiv:2302.12954 (2023)
Weicker, R.P.: Dhrystone: a synthetic systems programming benchmark. Commun. ACM 27(10), 1013–1030 (1984)
Article Google Scholar
Yokota, T., Ootsu, K., Baba, T.: Introducing entropies for representing program behavior and branch predictor performance. In: Proceedings of the 2007 Workshop on Experimental Computer Science, pp. 17-es (2007)
Google Scholar

Download references

Acknowledgments

This work is supported by the Strategic Priority Research Program of the Chinese Academy of Sciences, Grant No. XDA0320000 and XDA0320300.

Author information

Authors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Fan Zhang, Chenxi Wang, Chunjie Luo & Lei Wang
School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing, 100049, China
Chenxi Wang & Lei Wang
International Open Benchmark Council (BenchCouncil), Beijing, China
Chunjie Luo & Lei Wang

Authors

Fan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chenxi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chunjie Luo
View author publications
You can also search for this author in PubMed Google Scholar
Lei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Wang .

Editor information

Editors and Affiliations

TU Wien, Vienna, Austria
Sascha Hunold
Chinese Academy of Sciences, Beijing, China
Biwei Xie
Illinois Institute of Technology, Chicago, IL, USA
Kai Shu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, F., Wang, C., Luo, C., Wang, L. (2024). Cross-Layer Profiling of IoTBench. In: Hunold, S., Xie, B., Shu, K. (eds) Benchmarking, Measuring, and Optimizing. Bench 2023. Lecture Notes in Computer Science, vol 14521. Springer, Singapore. https://doi.org/10.1007/978-981-97-0316-6_5

Download citation

DOI: https://doi.org/10.1007/978-981-97-0316-6_5
Published: 14 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0315-9
Online ISBN: 978-981-97-0316-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Cross-Layer Profiling of IoTBench