# Lecture Notes in Computer Science

3133

Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

#### **Editorial Board**

Takeo Kanade

Carnegie Mellon University, Pittsburgh, PA, USA

Josef Kittler

University of Surrey, Guildford, UK

Jon M. Kleinberg

Cornell University, Ithaca, NY, USA

Friedemann Mattern

ETH Zurich, Switzerland

John C. Mitchell

Stanford University, CA, USA

Moni Naor

Weizmann Institute of Science, Rehovot, Israel

Oscar Nierstrasz

University of Bern, Switzerland

C. Pandu Rangan

Indian Institute of Technology, Madras, India

Bernhard Steffen

University of Dortmund, Germany

Madhu Sudan

Massachusetts Institute of Technology, MA, USA

Demetri Terzopoulos

New York University, NY, USA

Doug Tygar

University of California, Berkeley, CA, USA

Moshe Y. Vardi

Rice University, Houston, TX, USA

Gerhard Weikum

Max-Planck Institute of Computer Science, Saarbruecken, Germany

# Computer Systems: Architectures, Modeling, and Simulation

Third and Fourth International Workshops SAMOS 2003 and SAMOS 2004 Samos, Greece, July 21-23, 2003 and July 19-21, 2004 Proceedings



#### Volume Editors

Andy D. Pimentel University of Amsterdam, Department of Computer Science Kruislaan 403, 1098 SJ Amsterdam, The Netherlands E-mail: andy@science.uva.nl

Stamatis Vassiliadis TU Delft, Department of Computer Engineering Mathematics and Computer Science Faculty Mekelweg 4, 2628 CD Delft, The Netherlands E-mail: stamatis@dutepp0.et.tudelft.nl

Library of Congress Control Number: 2004109047

CR Subject Classification (1998): C, B

ISSN 0302-9743 ISBN 3-540-22377-0 Springer-Verlag Berlin Heidelberg New York

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable to prosecution under the German Copyright Law.

Springer-Verlag is a part of Springer Science+Business Media

springeronline.com

© Springer-Verlag Berlin Heidelberg 2004 Printed in Germany

Typesetting: Camera-ready by author, data conversion by Olgun Computergrafik Printed on acid-free paper SPIN: 11018575 06/3142 5 4 3 2 1 0

#### **Preface**

The SAMOS workshop is an international gathering of highly qualified researchers from academia and industry, sharing in a 3-day lively discussion on the quiet and inspiring northern mountainside of the Mediterranean island of Samos. As a tradition, the workshop features plenary presentations in the morning, while after lunch all kinds of informal discussions and nut cracking gatherings take place. The workshop is unique in the sense that not only solved research problems are presented and discussed, but also (partly) unsolved problems and in-depth topical reviews can be unleashed in the scientific arena. Consequently, the workshop provides the participants with an environment where collaboration rather than competition is fostered.

This fourth edition of the SAMOS workshop developed into a highly interesting event with a program that consisted of 36 paper presentations as well as a keynote speech by Kees Vissers from Xilinx Research. The paper authors represented 12 different countries, the USA, UK, Canada, Brazil, Germany, France, Spain, Finland, Belgium, Portugal, Greece and The Netherlands. The presentations were divided into three tracks: reconfigurable computing, architecture and implementation, and system modeling and simulation. We believe that all three tracks showed high-quality, state-of-the-art research in their respective fields.

Besides the papers from SAMOS IV, this proceedings also features numerous papers from the SAMOS III workshop. As the SAMOS III edition did not have a well-established proceedings mechanism, we decided to give the authors from last year's edition the opportunity to formally publish their papers in this proceedings.

A workshop like this cannot be organized without the help of many other people. We therefore want to thank the members of the Steering and Program Committees as well as the General Chair for their assistance in the organization and the review process for both the SAMOS III and IV papers that are included in this proceedings. Furthermore, we would like to express our sincere gratitude to Iosif Antochi who prepared the proceedings and to Lidwina Tromp for her support in organizing both the SAMOS III and IV workshops.

We hope that the attendees enjoyed the SAMOS IV workshop in all its aspects, including its many informal discussions and gatherings.

July 2004

Andy Pimentel Stamatis Vassiliadis

## Organization

The workshop SAMOS IV took place during July 19–21, 2004 at the Research and Teaching Institute of East Aegean (INEAG) in Agios Konstantinos on the island of Samos, Greece.

#### General Chair

Shuvra Bhattacharyya University of Maryland, USA

#### **Program Chair**

Andy Pimentel University of Amsterdam, The Netherlands

#### Steering Committee

Shuvra Bhattacharyya University of Maryland, USA Ed Deprettere Leiden University, The Netherlands

Patrice Quinton Irisa, France

Stamatis Vassiliadis Delft University of Technology, The Netherlands University of Erlangen-Nuremberg, Germany Jürgen Teich

### **Program Committee**

Nikitas Dimopoulos University of Victoria, Canada

Gerhard Fettweis TU Dresden, Germany

Georgi Gaydadijev Delft University of Technology, The Netherlands

John Glossner Sandbridge Technologies, USA Wayne Luk Imperial College London, UK

University of Amsterdam, The Netherlands Andy Pimentel Bernard Pottier Université de Bretagne Occidentale, France Jarmo Takala Tampere University of Technology, Finland

Serge Vernalde IMEC, Belgium

Jens Peter Wittenburg Thomson Corporate Research, Germany

## Local Organizers

Lidwina Tromp Delft University of Technology, The Netherlands Yiasmin Kioulafa

Research and Training Institute of East Aegean,

Greece

## **Table of Contents**

| SAMOS III – Reconfigurable Computing                                                          |    |
|-----------------------------------------------------------------------------------------------|----|
| The Molen Programming Paradigm                                                                | 1  |
| Loading $\rho\mu$ -Code: Design Considerations                                                | 11 |
| RAMPASS: Reconfigurable and Advanced Multi-processing Architecture for Future Silicon Systems | 20 |
| Basic OS Support for Distributed Reconfigurable Hardware                                      | 30 |
| A Cost-Efficient RISC Processor Platform for Real Time Audio Applications                     | 39 |
| Customising Processors: Design-Time and Run-Time Opportunities Wayne Luk                      | 49 |
| Intermediate Level Components for Reconfigurable Platforms                                    | 59 |
| Performance Estimation of Streaming Media Applications for Reconfigurable Platforms           | 69 |
| SAMOS III – Architectures and Implementation                                                  |    |
| CoDeL: Automatically Synthesizing Network Interface Controllers                               | 78 |
| Performance and Power Evaluation of Clustered VLIW Processors with Wide Functional Units      | 88 |

| An Optimized Flow for Designing High-Speed,  Large-Scale CMOS ASIC SoCs                                           |
|-------------------------------------------------------------------------------------------------------------------|
| Register-Based Permutation Networks for Stride Permutations                                                       |
| A Family of Accelerators for Matrix-Vector Arithmetics Based on High-Radix Multiplier Structures                  |
| Metrics for Digital Signal Processing Architectures Characterization:  Remanence and Scalability                  |
| Virtual Architecture Mapping: A SystemC Based Methodology for Architectural Exploration of System-on-Chip Designs |
| SAMOS III –<br>Compilers, System Modeling, and Simulation                                                         |
| Comparison of Data Dependence Analysis Tests                                                                      |
| MOUSE: A Shortcut from Matlab Source to SIMD DSP Assembly Code 159 $Gordon\ Cichon\ and\ Gerhard\ Fettweis$       |
| High-Level Energy Estimation for ARM-Based SOCs                                                                   |
| IDF Models for Trace Transformations:  A Case Study in Computational Refinement                                   |
| Systems, Architectures, Modeling, and Simulation 2004 (SAMOS IV)                                                  |
| Programming Extremely Flexible Platforms                                                                          |
| SAMOS IV – Reconfigurable Computing                                                                               |
| The Virtex II Pro <sup>TM</sup> MOLEN Processor                                                                   |

| Reconfigurable Hardware for a Scalable Wavelet Video Decoder and Its Performance Requirements                                                               |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Design Space Exploration for Configurable Architectures and the Role of Modeling, High-Level Program Analysis and Learning Techniques 213<br>Pedro C. Diniz |
| Modeling Loop Unrolling: Approaches and Open Issues                                                                                                         |
| Self-loop Pipelining and Reconfigurable Dataflow Arrays                                                                                                     |
| Architecture Exploration for 3G Telephony Applications Using a Hardware–Software Prototyping Platform                                                       |
| Embedded Context Aware Hardware Component Generation for Dataflow System Exploration                                                                        |
| On the (Re-)Use of IP-Components in Re-configurable Platforms                                                                                               |
| Customising Hardware Designs for Elliptic Curve Cryptography                                                                                                |
| Dynamic Hardware Reconfigurations: Performance Impact for MPEG2 284<br>Elena Moscu Panainte, Koen Bertels, and Stamatis Vassiliadis                         |
| Compiler and System Techniques for Soc Distributed Reconfigurable Accelerators                                                                              |
| SAMOS IV – Architectures and Implementation                                                                                                                 |
| Design Space Exploration with Automatic Selection of SW and HW for Embedded Applications                                                                    |
| On Enhancing SIMD-Controlled DSPs for Performing Recursive Filtering . 313 $\it Michael~Hosemann~and~Gerhard~Fettweis$                                      |
| Memory Bandwidth Requirements of Tile-Based Rendering                                                                                                       |

| Using CoDeL to Rapidly Prototype Network Processor Extensions                                                              |
|----------------------------------------------------------------------------------------------------------------------------|
| Synchronous Transfer Architecture (STA)                                                                                    |
| Generated DSP Cores for Implementation of an OFDM Communication System                                                     |
| A Novel Data-Path for Accelerating DSP Kernels                                                                             |
| Scalable FFT Processors and Pipelined Butterfly Units                                                                      |
| Scalable Instruction-Level Parallelism                                                                                     |
| A Low-Power Multithreaded Processor for Baseband Communication Systems                                                     |
| Initial Evaluation of Multimedia Extensions on VLIW Architectures 403  Esther Salamí and Mateo Valero                      |
| HIBI v.2 Communication Network for System-on-Chip                                                                          |
| SAMOS IV – System Modeling, and Simulation                                                                                 |
| DIF: An Interchange Format for Dataflow-Based Design Tools                                                                 |
| Scalable and Modular Scheduling                                                                                            |
| Early ISS Integration into Network-on-Chip Designs                                                                         |
| Cycle Accurate Simulation Model Generation for SoC Prototyping 453  Antoine Fraboulet. Tanawa Risset. and Antoine Scherrer |

| Modeling Instruction Semantics in ADL Processor Descriptions for C Compiler Retargeting                       |
|---------------------------------------------------------------------------------------------------------------|
| Jianjiang Ceng, Weihua Sheng, Manuel Hohenauer, Rainer Leupers, Gerd Ascheid, Heinrich Meyr, and Gunnar Braun |
| A Communication-Centric Design Flow for HIBI-Based SoCs                                                       |
| Performance Analysis of SoC Communication by Application of Deterministic and Stochastic Petri Nets           |
| Communication Optimization in Compaan Process Networks                                                        |
| Analysis of Dataflow Programs with Interval-Limited Data-Rates 507  Jürgen Teich and Shuvra S. Bhattacharyya  |
| High-Speed Event-Driven RTL Compiled Simulation                                                               |
| A High-Level Programming Paradigm for SystemC                                                                 |
| Power, Performance and Area Exploration for Data Memory Assignment of Multimedia Applications                 |
| Constraints Derivation and Propagation for Large-Scale Embedded Systems Exploration                           |
| Author Index561                                                                                               |