Abstract
This paper considers code optimization using the novel TS1xx processor from Analog Devices. Very large instruction word architectures (VLIW), such as the TS1xx represent the state of the art in high-performance signal processing. The theoretically achievable peak performance of VLIW processors increases steadily with the use of on-chip parallelism. It is demonstrated that C compiler technology cannot achieve peak computing rates on a statically scheduled processor and the applications programmer must rely on hand optimized Assembler Libraries. This necessitates intimate knowledge of the specific compiler optimization techniques, as well as the underlying hardware. Compiler friendly code optimized by the VisualC2.0 compiler, is compared against hand optimized Assembler code for a common operation involving a loop with multiple memory accesses, floating point arithmetic and pointer operations. It is found that mature C code for matrix vector multiplication executes in roughly 1.18 * n * m cycles, whereas the same operation optimized in assembler has a cycle complexity of 0.5 * n(m + 16) − a measurable performance improvement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Analog Devices: TigerSharc DSP Hardware Specification. Manual (2001)
P. Faraboschi, G. Desoli, J.F.: The latest word in digital and media processing. IEEE Signal Processing Magazine (1998) 59–85
Bogong Su, J.Wang, A.E.: Source-level loop optimization for dsp code generation. IEEE International Conference on Acoustics, Speech and Signal Processing 4 (1999) 2155–2158
Analog Devices: TigerSharc DSP Instruction Set Specification. Manual (2001)
Dezso Sima, Terence Fountain, Peter Kacsuk: Advanced Computer Architectures. First edn. Addison-Wesley (1997)
Analog Devices: Engineer to Engineer Note EE-147. Manual (2001)
Analog Devices: C/C++2.0 Compiler and Library Manual for TigerSharc. Manual (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pilz, N.A., Adamson, K. (2002). Code Optimization Techniques of Data-Intensive Tasks onto Statically Scheduled Architectures: Optimal Performance on the TigerSharc. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_18
Download citation
DOI: https://doi.org/10.1007/3-540-48051-X_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43786-4
Online ISBN: 978-3-540-48051-8
eBook Packages: Springer Book Archive