ABSTRACT
In recent years we have seen the emergence of context-aware mobile sensing apps which employ machine learning algorithms on real-time sensor data to infer user behaviors and contexts. These apps are typically optimized for power and performance on the app processors of mobile platforms. However, modern mobile platforms are sophisticated system on chips (SoCs) where the main app processors are complemented by multiple co-processors. Recently chip vendors have undertaken nascent efforts to make these previously hidden co-processors such as the digital signal processors (DSPs) programmable. In this paper, we explore the energy and performance implications of off-loading the computation associated with machine learning algorithms in context-aware apps to DSPs embedded in mobile SoCs. Our results show a 17% reduction in a TI OMAP4 based mobile platform's energy usage from off-loading context classification computation to the DSP core with indiscernible latency overhead. We also describe the design of a run-time system service for energy efficient context inference on Android devices, which takes parameters from the app to instantiate the classification model and schedules the execution on the DSP or app processor as specified by the app.
- Apple iPhone5s. http://goo.gl/1LUSj0.Google Scholar
- Google Activity Recognition API. http://goo.gl/mYJn84.Google Scholar
- Qualcomm Snapdragon. http://goo.gl/ZFTm0.Google Scholar
- Renderscript. http://goo.gl/W1jGz.Google Scholar
- TI OMAP. http://goo.gl/9Z5R4.Google Scholar
- TI Pandaboard. http://goo.gl/ujdiL.Google Scholar
- VOICEBOX: Speech Processing Toolbox for MATLAB. http://goo.gl/wakDY.Google Scholar
- C.-C. Chang and C.-J. Lin. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST), 2(3): 27, 2011. Google ScholarDigital Library
- D. Chu, N. D. Lane, T. T.-T. Lai, C. Pang, X. Meng, Q. Guo, F. Li, and F. Zhao. Balancing energy, latency and accuracy for mobile sensor data classification. In Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems (SenSys'11), pages 54--67. ACM, 2011. Google ScholarDigital Library
- P. Greenhalgh. Big. little processing with arm cortex-a15 & cortex-a7. ARM White Paper, 2011.Google Scholar
- T. Huynh, M. Fritz, and B. Schiele. Discovery of activity patterns using topic models. In Proceedings of the 10th international conference on Ubiquitous computing (UbiComp'08), pages 10--19. ACM, 2008. Google ScholarDigital Library
- Y. Ju, Y. Lee, J. Yu, C. Min, I. Shin, and J. Song. Symphoney: a coordinated sensing flow execution engine for concurrent mobile sensing applications. In Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems (SenSys'12), pages 211--224. ACM, 2012. Google ScholarDigital Library
- F. X. Lin, Z. Wang, R. LiKamWa, and L. Zhong. Reflex: using low-power processors in smartphones without knowing them. In Proceedings of the ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XVII),. ACM, March 2012. Google ScholarDigital Library
- F. X. Lin, Z. Wang, and L. Zhong. Supporting distributed execution of smartphone workloads on loosely coupled heterogeneous processors. In Proceedings of the 4th Workshop on Power-Aware Computing and Systems (HotPower'12), October 2012. Google ScholarDigital Library
- H. Lu, J. Yang, Z. Liu, N. D. Lane, T. Choudhury, and A. T. Campbell. The jigsaw continuous sensing engine for mobile phone applications. In Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems (SenSys'10), pages 71--84. ACM, 2010. Google ScholarDigital Library
- B. Priyantha, D. Lymberopoulos, and J. Liu. Littlerock: Enabling energy-efficient continuous sensing on mobile phones. Pervasive Computing, IEEE, 10(2): 12--15, 2011. Google ScholarDigital Library
- M.-R. Ra, B. Priyantha, A. Kansal, and J. Liu. Improving energy efficiency of personal sensing applications with heterogeneous multi-processors. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing (UbiComp'12), pages 1--10. ACM, 2012. Google ScholarDigital Library
Index Terms
- Exploiting processor heterogeneity for energy efficient context inference on mobile phones
Recommendations
A unified processor architecture for RISC & VLIW DSP
GLSVLSI '05: Proceedings of the 15th ACM Great Lakes symposium on VLSIThis paper presents a unified processor core with two operation modes. The processor core works as a compiler-friendly MIPS-like core in the RISC mode, and it is a 4-way VLIW in its DSP mode, which has distributed and ping-pong register organization ...
Exploiting Processor Workload Heterogeneity for Reducing Energy Consumption in Chip Multiprocessors
DATE '04: Proceedings of the conference on Design, automation and test in Europe - Volume 2Advances in semiconductor technology are enabling designs with several hundred million transistors. Since building sophisticated single processor based systems is a complex process from design, veri.cation, and software development perspectives, the use ...
Asynchronous DSP for low-power energy-efficient embedded systems
This paper presents the design of an asynchronous DSP that is code compatible with the Motorola DSP56000, with the objective of low power and high energy efficiency to extend the lifespan of the batteries in embedded systems embodying the DSP. It ...
Comments