skip to main content
10.1145/1278480.1278682acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
Article

Design methodology for pipelined heterogeneous multiprocessor system

Published:04 June 2007Publication History

ABSTRACT

Multiprocessor SoC systems have led to the increasing use of parallel hardware along with the associated software. These approaches have included coprocessor, homogeneous processor (e.g. SMP) and application specific architectures (i.e. DSP, ASIC). ASIPs have emerged as a viable alternative to conventional processing entities (PEs) due to its configurability and programmability. In this work, we introduce a heterogeneous multi-processor system using ASIPs as processing entities in a pipeline configuration. A streaming application is taken and manually broken into a series of algorithmic stages (each of which make up a stage in a pipeline). We formulate the problem of mapping each algorithmic stage in the system to an ASIP configuration, and propose a heuristic to efficiently search the design space for a pipeline-based multi ASIP system.

We have implemented the proposed heterogeneous multiprocessor methodology using a commercial extensible processor (Xtensa LX from Tensilica Inc.). We have evaluated our system by creating two benchmarks (MP3 and JPEG encoders) which are mapped to our proposed design platform. Our multiprocessor design provided a performance improvement of at least 4.11X (JPEG) and 3.36X (MP3) compared to the single processor design. The minimum cost obtained through our heuristic was within 5.47% and 5.74% of the best possible values for JPEG and MP3 benchmarks respectively.

References

  1. Altera Nios Processor. Altera Corp. (http://www.altera.com).Google ScholarGoogle Scholar
  2. ARCtangent. ARC International (http://www.arc.com).Google ScholarGoogle Scholar
  3. SP-5flex. 3DSP Corp. (http://www.3dsp.com).Google ScholarGoogle Scholar
  4. SystemC Initiative. (http://www.systemc.org).Google ScholarGoogle Scholar
  5. Xtensa Processor. Tensilica Inc. (http://www.tensilica.com).Google ScholarGoogle Scholar
  6. Flix: Fast relief for performance-hungry embedded applications. Tensilica Inc. (http://www.tensilica.com/pdf/FLIX_White_Paper_v2.pdf), 2005.Google ScholarGoogle Scholar
  7. J. Axelsson. A Case Study in Heterogeneous Implementation of Automotive Real-Time Systems. In CODES'98, Seattle, 1998.Google ScholarGoogle Scholar
  8. S. Banerjee, T. Hamada, P. M. Chau, and R. D. Fellman. Macro Pipelining Based Scheduling on High Performance Heterogeneous Multiprocessor Systems. Signal Processing, IEEE Transactions on, 43(6):1468--1484, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Baruah. Task partitioning upon heterogeneous multiprocessor platforms. In RTAS'04, pages 536--543, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Berić, R. Sethuraman, C. A. Pinto, H. Peters, G. Veldman, P. van de Haar, and M. Duranton. Heterogeneous Multiprocessor for High Definition Video. In ICCE'06, pages 401--402, 2006.Google ScholarGoogle Scholar
  11. T. D. Braun, H. J. Siegel, and A. A. Maciejewski. Heterogeneous computing: Goals, methods, and open problems. In HiPC 2001, volume 2228, pages 302--320, Hyderabad, India, 2001. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. S. Chatha and R. Vemuri. A Tool for Partitioning and Pipelined Scheduling of Hardware-Software Systems. In ISSS'98, pages 145--151, Hsinchu, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. CriticalBlue. Coprocessor synthesis - increassing system on chip platform ROL Technical report, CriticalBlue, June 2004.Google ScholarGoogle Scholar
  14. T. Givargis, F. Vahid, and J. Henkel. System-Level Exploration for Pareto-Optimal COnfigurations in Parameterized System-on-a-Chip. Very Large Scale Integration (VLSI) Systems, IEEE Transactions on, 10(4):416--422, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Gopalakrishnan and M. Caccamo. Task Partitioning with Replication upon Heterogeneous Multiprocessor Systems. In RTAS'06, pages 199--207, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. L. Hennessy and D. A. Patterson. Computer Architecture: A Quantitative Approach. Morgan Kaufmann Publishers, 3rd edition, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Jeon and K. Choi. Loop Pipelining in Hardware-Software Partitioning. In ASP-DAC'98, pages 361--366, Yokohama, Japan, 1998.Google ScholarGoogle Scholar
  18. G. Kahn. The semantics of a simple language for parallel programming. In IFIP'74, pages 471--475, Stockolm, Sweden, 1974.Google ScholarGoogle Scholar
  19. M. Kim, D. Kim, and G. E. Sobelman. MPEG-4 performance analysis for a CDMA network-on-chip. In ICCCAS'05, pages 493--496, 2005.Google ScholarGoogle Scholar
  20. T. Kodaka, K. Kimura, and H. Kasahara. Multigrain Parallel Processing for JPEG Encoding on a Single Chip Multiprocessor. In IWIA'02, pages 57--63, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Kumar, D. Tullsen, N. Jouppi, and P. Ranganathan. Heterogeneous Chip Multiprocessors. Computer, 38(11):32--38, November 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. D. Pham. The design and implementation of a first-generation cell processor. In ISSCC 2005, pages 184--186. IEEE CS Press, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  23. F. Salice, L. Del Vecchio, L. Pomante, and W. Fornaciari. Partitioning of Embedded Applications onto Heterogeneous Multiprocessor Architectures. In ACM symposium on Applied computing, pages 661--665, Melbourne, Florida, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. L. Shee, A. Erdos, and S. Parameswaran. Heterogeneous Multiprocessor Implementations for JPEG: A Case Study. In CODES+ISSS'06, Seoul, Korea, 2006.Google ScholarGoogle Scholar
  25. G. C. Sih and E. A. Lee. Declustering: A New Multiprocessor Scheduling Technique. IEEE Transactions of Parallel and Distributed Systems, 4(6):625--637, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J. E. Smith and G. S. Sohi. The Microarchitecture of Superscalar Processors. Proceedings of the IEEE, 83(12):1609--1624, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  27. M. T. J. Strik, A. H. Timmer, J. L. van Meerbergen, and G.-J. van Rootselaar. Heterogeneous multiprocessor for the management of real-time video and graphics streams. Solid-State Circuits, IEEE Journal of, 35(11):1722--1731, 2000.Google ScholarGoogle Scholar
  28. F. Sun, S. Ravi, A. Raghunathan, and N. K. Jha. Custom-instruction synthesis for extensible-processor platforms. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 23(2):216--228, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. F. Sun, S. Ravi, A. Raghunathan, and N. K. Jha. Synthesis of Application-specific Heterogeneous Miltiprocessor Architectures using Extensible Processors. In VL-SID'05, pages 551--556, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. V. Živojnović, S. Pees, and H. Myer. LISA-machine description language and generic machine model for HW/SW co-design. In Workshop on VLSI Signal Processing, pages 127--136, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  31. A. Wieferink, M. Doerper, R. Leupers, G. Ascheid, H. Meyr, T. Kogel, G. Braun, and A. Nohl. System Level Processor/Communication Co-exploration Methodology for Multiprocessor System-on-Chip Platforms. Computers and Digital Techniques, IEE Proceedings, 152(1):3--11, 2005.Google ScholarGoogle Scholar
  32. N. Zhang and C.-H. Wu. Study on Adaptive Job Assignment for Multiprocessor Implementation of MPEG2 Video Encoding. Industrial Electronics, IEEE Transactions on, 44(5):726--734, 1997.Google ScholarGoogle Scholar

Index Terms

  1. Design methodology for pipelined heterogeneous multiprocessor system

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            DAC '07: Proceedings of the 44th annual Design Automation Conference
            June 2007
            1016 pages
            ISBN:9781595936271
            DOI:10.1145/1278480

            Copyright © 2007 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 4 June 2007

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            DAC '07 Paper Acceptance Rate152of659submissions,23%Overall Acceptance Rate1,770of5,499submissions,32%

            Upcoming Conference

            DAC '24
            61st ACM/IEEE Design Automation Conference
            June 23 - 27, 2024
            San Francisco , CA , USA

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader