ABSTRACT
Parallel computing at all levels is becoming important in all devices and not least in mobile and embedded systems. Many wireless, mobile and deployable devices make use of the ARM CPU and its variants. We report on investigations into measuring instruction level parallelism on the ARM processor and on characterising the fine grained parallelism of four generations of ARM cores. We discuss the implications for future applications that use such devices with parallelism and for the future of mobile computing device architectures.
- http://rtcgroup.com/arm/2007/presentations/253%20-%20ARM_DevCon_2007_Snapdragon_FINAL_20071004.pdfGoogle Scholar
- Abbott, D.: Linux for Embedded and Real-Time Applcations. ISBN: 978-0-7506-7932-9, Newnes/Elsevier (2006) Google ScholarDigital Library
- Alameldeen, A. R.; Wood, D. A.;, "IPC Considered Harmful for Multiprocessor Workloads," Micro, IEEE, vol.26, no.4, pp.8--17, July-Aug. 2006 doi: 10.1109/MM.2006.73 Google ScholarDigital Library
- Bormann, C., Castellani, A., Shelby, Z.: Coap: An application protocol for billions of tiny internet nodes. IEEE Internet Computing 16, 62--67 (2012) Google ScholarDigital Library
- Butgereit, L., Coetzee, L., Smith, A. C.: Turn me on! using the "internet of things" to turn things on and off. In: Proc. 6th Int. Conf. on Pervasive Computing and Applications (ICPCA). pp. 4--10. Port Elizabeth, South Africa (26--28 October 2011)Google Scholar
- Cheng, Y., lin Liu, Y., Liang, Y., qin Wang, S.: Design of image acquisition and storage system based on arm and embedded. In: Proc. 2nd Int. Conf. on Consumer Electronics, Communications and networks (CECNet). pp. 981--984. Xianning, China (21--23 April 2012)Google ScholarCross Ref
- Engel, M., Freisleben, B., Smith, M., Hanemann, S.: Wireless ad-hoc network emulation using microkernel-based virtual linux systems. In: Proceedings of the 5th EUROSIM Congress on Modeling and Simulation, Marne la Vallee, France. pp. 198--203. EUROSIM Publishers (2004)Google Scholar
- Guinard, D., Trifa, V., Mattern, F., Wilde, E.: From the internet of things to the web of things: Resource oriented architecture and best practices1. In: Architecting the Internet of Things. Springer (2011)Google ScholarCross Ref
- Hawick, K., Leist, A., Playne, D., Johnson, M.: Comparing Intra- and Inter-Processor Parallelism on Multi-Core CellBE Processors for Scientific Simulations. In: Proc. Parallel and Distributed Computing and Systems (PDCS 2010) (2010)Google Scholar
- Kopetz, H.: Real-Time Systems: Design Principles for Distributed Embedded Applications, chap. Internet of Things, pp. 307--323. Springer (2011)Google Scholar
- Koranne, S.: Handbook of Open Source Tools. Springer (2011), iSBN 978-1-4419-7718-2 Google ScholarDigital Library
- Leist, A., Playne, D., Hawick, K.: Exploiting Graphical Processing Units for Data-Parallel Scientific Applications. Concurrency and Computation: Practice and Experience 21, 2400--2437 (December 2009), {CSTN-065} Google ScholarDigital Library
- Nbench documentation: http://www.tux.org/~mayer/linux/byte/bdoc.pdfGoogle Scholar
- Panda: Panda board development resources (March 2012), http://pandaboard.orgGoogle Scholar
- Perihelion Distributed Software: Helios-arm user guide (October 1995)Google Scholar
- Pi, R.: Raspberry pi - an arm gnu/linux box (march 2012), http://www.raspberrypi.orgGoogle Scholar
- Pratyusha, G., Ramesh, N.: Porting the linux kernel to an arm based development board. Int. J. Engineering Research and Applications 2(2), 1614--1618 (Mar-Apr 2012)Google Scholar
- Qualcomm Snapdragon Whitepapers: http://www.qualcomm.com/snapdragonGoogle Scholar
- Sloss, A. N.: {ARM System Developer's Guide: Designing and Optimizing System Software}. Elsevier (May 2010), iSBN 978-0080-490-496 Google ScholarDigital Library
- Uckelmann, D., Harrison, M., Michahelles, F.: An architectural approach towards the future internet of things. In: Architecting the Internet of Things. Springer (2011)Google ScholarDigital Library
- Yarow, J.: Tablet computing: A history of failure. Business Insider Online, 1--3 (January 2010)Google Scholar
Index Terms
- Empirical measurement of instruction level parallelism for four generations of ARM CPUs
Recommendations
Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruction-level parallelism (ILP) and thread-level parallelism (TLP). Wide-issue super-scalar processors exploit ILP by executing multiple instructions from a ...
Dual-IS: Instruction Set Modality for Efficient Instruction Level Parallelism
Architecture of Computing SystemsAbstractExploiting instruction level parallelism (ILP) is a widely used method for increasing performance of processors. While traditional very long instruction word (VLIW) processors can exploit ILP energy-efficiently thanks to static instruction ...
Natural instruction level parallelism-aware compiler for high-performance QueueCore processor architecture
This work presents a static method implemented in a compiler for extracting high instruction level parallelism for the 32-bit QueueCore, a queue computation-based processor. The instructions of a queue processor implicitly read and write their operands, ...
Comments