# Lecture Notes in Computer Science

7199

Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

#### **Editorial Board**

David Hutchison

Lancaster University, UK

Takeo Kanade

Carnegie Mellon University, Pittsburgh, PA, USA

Josef Kittler

University of Surrey, Guildford, UK

Jon M. Kleinberg

Cornell University, Ithaca, NY, USA

Alfred Kobsa

University of California, Irvine, CA, USA

Friedemann Mattern

ETH Zurich, Switzerland

John C. Mitchell

Stanford University, CA, USA

Moni Naor

Weizmann Institute of Science, Rehovot, Israel

Oscar Nierstrasz

University of Bern, Switzerland

C. Pandu Rangan

Indian Institute of Technology, Madras, India

Bernhard Steffen

TU Dortmund University, Germany

Madhu Sudan

Microsoft Research, Cambridge, MA, USA

Demetri Terzopoulos

University of California, Los Angeles, CA, USA

Doug Tygar

University of California, Berkeley, CA, USA

Gerhard Weikum

Max Planck Institute for Informatics, Saarbruecken, Germany

Oliver C.S. Choy Ray C. C. Cheung Peter Athanas Kentaro Sano (Eds.)

# Reconfigurable Computing: Architectures, Tools and Applications

8th International Symposium, ARC 2012 Hong Kong, China, March 19-23, 2012 Proceedings



#### Volume Editors

Oliver C.S. Choy The Chinese University of Hong Kong Department of Electronic Engineering Shatin, N.T., Hong Kong, China E-mail: cschoy@ee.cuhk.edu.hk

Ray C. C. Cheung City University of Hong Kong Department of Electronic Engineering Kowloon Tong, Hong Kong, China E-mail: r.cheung@cityu.edu.hk

Peter Athanas Virginia Tech, Department of ECE 302 Whittemore Hall Blacksburg, VA 24061, USA E-mail: athanas@vt.edu

Kentaro Sano Tohoku University 6-6-01 Aramaki Aza Aoba, Aobaku Sendai, Miyagi, 981-8579, Japan E-mail: kentah@caero.mech.tohoku.ac.jp

ISSN 0302-9743 e-ISSN 1611-3349 ISBN 978-3-642-28364-2 e-ISBN 978-3-642-28365-9 DOI 10.1007/978-3-642-28365-9 Springer Heidelberg Dordrecht London New York

Library of Congress Control Number: 2012931224

CR Subject Classification (1998): C.2, D.2, I.4, H.3, F.1, I.6, D.3

LNCS Sublibrary: SL 1 – Theoretical Computer Science and General Issues

© Springer-Verlag Berlin Heidelberg 2012

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law.

The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India

Printed on acid-free paper

Springer is part of Springer Science+Business Media (www.springer.com)

### Preface

The 8th International Symposium on Applied Reconfigurable Computing (ARC 2012) was held at the Chinese University of Hong Kong (CUHK), Shatin, Hong Kong during March 21–23, 2011. The symposium also included industrial workshops provided by Xilinx and Altera in CUHK and the City University of Hong Kong during March 19–20, 2011. The symposium and workshop were sponsored by Xilinx Inc., Altera Corp., and CPO Technologies Corporation.

ARC 2012 received 44 submissions from 17 countries all over the world. Each paper was reviewed by at least four committee members, involving nearly 200 reviews. More than 80 committee members were invited to form the Program Committee according to their particular areas of expertise. The Program Committee selected 25 papers for oral presentation, and 10 poster papers for the proceedings. The main theme this year was "ARC for better living". The program was completed with five excellent invited talks given by Michael J. Flynn (Stanford University), Sorin A. Huss (Technische Universitaet Darmstadt), Cetin Kaya Koc (UCSB), Wayne Luk (Imperial College London), and Grant Martin (Tensilica Inc.).

Many people contributed to the success of ARC 2012. First, we would like to thank all the authors who submitted their excellent research results to ARC. Second, we would like to thank all of the 80 Program Committee members, as well as the external reviewers, who volunteered to read the papers. We are greatly indebted to the Proceedings Chair, Yuet Ming Lam, for his relentless efforts in compiling the proceedings. We would like to thank the local Organizing Committee Chairs, Evan Young, Hayden So, and their team for all the hotel and local arrangements. We would like to thank all the great efforts of the ARC 2012 Organizing Committee team, and the Conference Secretariat, Momentous Asia, Hong Kong. Finally, we want to express our gratitude to our generous sponsors: Xilinx Inc., Altera Corp., CPO Technologies Corporation, and the support from the local Hong Kong universities and the IEEE Hong Kong Section Computer Society Chapter and CAS/COM Chapter.

December 2011

Oliver Choy Ray Cheung Peter Athanas Kentaro Sano

# Organization

ARC 2012 was organized by the Department of Electronic Engineering, the Chinese University of Hong Kong and the Department of Electronic Engineering, City University of Hong Kong.

## Organising Committee

General Chair

Oliver Choy The Chinese University of Hong Kong,

Hong Kong

Ray Cheung City University of Hong Kong, Hong Kong

**Program Chair** 

Peter Athanas Virginia Tech, USA Kentaro Sano Tohoku University, Japan

Local Arrangements Chair

Hayden So University of Hong Kong, Hong Kong Evan Young The Chinese University of Hong Kong,

Hong Kong

**Publicity Chair** 

Suhaib Fahmy Nanyang Technological University, Singapore

Bryan Hu University of Alberta, Canada

Proceedings Chair

Yuet Ming Lam Macau University of Science and Technology,

Macau

Finance Chair

Chiwai Yu City University of Hong Kong, Hong Kong

Registration Chair

Bruce Sham The Hong Kong Polytechnic University,

Hong Kong

**Banquet Chair** 

Patrick Hung CPO Technologies Corporation, USA

#### Sponsorship Chair

Wei Zhang Nanyang Technological University, Singapore

Webmaster

Cedric Yiu The Hong Kong Polytechnic University,

Hong Kong

Secretariat

Laurie Lau Momentous Asia, Hong Kong

## **Program Committee**

George Constantinides

Jeff Arnold Strech Inc., USA Peter Athanas Virginia Tech, USA

Xilinx Research Labs, San Jose, USA Michael Attig Jürgen Becker Universität Karlsruhe (TH), Germany

Khaled Benkrid University of Edinburgh, UK

Mladen Berekovic Braunschweig University of Technology,

Germany

Neil Bergmann University of Queensland, Australia Koen Bertels Delft University of Technology,

The Netherlands

Christos-Savvas Bouganis Imperial College London, UK

Altera Corp., University of Toronto, Canada Stephen Brown João M.P. Cardoso University of Porto/INESC-ID, Portugal Ray Cheung City University of Hong Kong, Hong Kong The Chinese University of Hong Kong, Oliver Choy

Hong Kong

The Hong Kong University of Science and Albert Chung

Technology, Hong Kong

Katherine Compton University of Wisconsin-Madison, USA

Imperial College London, UK

Florent de Dinechin Ecole Normale Superieure de Lyon, France Pedro C. Diniz

Technical University of Lisbon (IST) /

INESC-ID. Portugal

Tarek El-Ghazawi George Washington University, USA

Robert Esser Apple Inc., USA

Nanyang Technological University, Singapore Suhaib Fahmy

António Ferrari University of Aveiro, Portugal George Mason University, USA Kris Gai Guy Gognia Université de Bretagne Sud, France National University of Singapore Yajun Ha Jim Harkin University of Ulster, Magee, UK Reiner Hartenstein University of Kaiserslautern, Germany

Roman Hermida Universidad Complutense, Madrid, Spain Christian Hochberger TU Dresden, Germany

Michael Hübner Karlsruhe Institute of Technology (KIT),

Germany

Bryan Hu University of Alberta, Canada Patrick Hung CPO Technologies Corporation Masahiro Iida Kumamoto University, Japan

Yasushi Inoguchi Japan Advanced Institute of Science and

Technology, Japan

Tomonori Izumi Ritsumeikan University, Japan

Ryan Kastner University of California, San Diego, USA

Andreas Koch TU Darmstadt, Germany

Ram Krishnamurthy Intel Corp., USA

Yuet-Ming Lam Macau University of Science and Technology,

PR. China.

Philip Leong University of Sydney, Australia Wayne Luk Imperial College London, UK Terrence Mak Newcastle University, UK Eduardo Marques University of São Paulo, Brazil

Kostas Masselos University of the Peloponnese, Greece

Sanu Mathew Intel Corp., USA

John McAllister Queen's University of Belfast, UK Seda Memik Northwestern University, USA

Takefumi Miyoshi The University of Electro-Communications,

Japan

Fearghal Morgan National University of Ireland, Galway, Ireland Vikram Narayana The George Washington University, USA

Brent Nelson Brigham Young University, USA

Horacio Neto INESC-ID/IST, Portugal
Elaine Ou University of Sydney, Australia
Joon-seok Park Inha University, Seoul, South Korea
Thilo Pionteck University of Luebeck, Germany

Joachim Pistorius Altera Corp., USA

Marco Platzner University of Paderborn, Germany Bernard Pottier University of Bretagne, France

Francisco Rodriguez-Henriquez Cinvestav, Mexico

Kyle Rupnow Advanced Digital Sciences Center

Kentaro Sano Tohoku University, Japan Tsutomu Sasao Kyushu Institute of Technology

Yukinori Sato Japan Advanced Institute of Science and

Technology, Japan

Erkay Savas Sabanci University, Turkey

Patrick Schaumont Virginia Tech, USA

Bruce Sham The Hong Kong Polytecyhnic University,

Hong Kong

Farhana Sheikh Intel Corp., USA Pete Sedcole CSR, France

#### X Organization

Lesley Shannon Simon Fraser University, Canada Yuchiro Shibata Nagasaki University, Japan

Hayden So University of Hong Kong, Hong Kong

Pedro Trancoso University of Cyprus, Cyprus

Markus Weinhardt Osnabrück University of Applied Sciences,

Germany

Stephan Wong Delft University of Technology,

The Netherlands

Roger Woods The Queen's University of Belfast, UK

Yoshiki Yamaguchi Tsukuba University, Japan

Cedric Yiu The Hong Kong Polytechnic University,

Hong Kong

Evan Young The Chinese University of Hong Kong,

Hong Kong

Peter Zipf University of Kassel, Germany

## Sponsoring Organizations

Gold sponsorship Xilinx Inc

Silver sponsorship Altera Corporation

Bronze sponsorship CPO Technology Corporation

# **Table of Contents**

| Applied RC Design Methods and Tools                                                                             |     |
|-----------------------------------------------------------------------------------------------------------------|-----|
| Automating Reconfiguration Chain Generation for SRL-Based<br>Run-Time Reconfiguration                           | 1   |
| Karel Heyse, Brahim Al Farisi, Karel Bruneel, and Dirk Stroobandt                                               |     |
| Architecture-Aware Reconfiguration-Centric Floorplanning for Partial Reconfiguration                            | 13  |
| Domain-Specific Language and Compiler for Stencil Computation on FPGA-Based Systolic Computational-Memory Array | 26  |
| Exploiting Both Pipelining and Data Parallelism with SIMD                                                       |     |
| Reconfigurable Architecture                                                                                     | 40  |
| Table-Based Division by Small Integer Constants                                                                 | 53  |
| Heterogeneous Systems for Energy Efficient Scientific Computing $\ldots$ Qiang Liu and Wayne Luk                | 64  |
| The Q <sup>2</sup> Profiling Framework: Driving Application Mapping for Heterogeneous Reconfigurable Platforms  | 76  |
| Applied RC Architectures                                                                                        |     |
| PPMC: A Programmable Pattern Based Memory Controller                                                            | 89  |
| A Run-Time Task Migration Scheme for an Adjustable Issue-Slots Multi-core Processor                             | 102 |
| Boosting Single Thread Performance in Mobile Processors via<br>Reconfigurable Acceleration                      | 114 |

| FPGAs Finite Field Digit Serial Multipliers on                                                                                                  | 12 |
|-------------------------------------------------------------------------------------------------------------------------------------------------|----|
| Gang Zhou, Li Li, and Harald Michalik                                                                                                           |    |
| ScalableCore System: A Scalable Many-Core Simulator by Employing over 100 FPGAs                                                                 | 13 |
| Shinya Takamaeda-Yamazaki, Shintaro Sano, Yoshito Sakaguchi,<br>Naoki Fujieda, and Kenji Kise                                                   |    |
| Scalable Memory Hierarchies for Embedded Manycore Systems Sen Ma, Miaoqing Huang, Eugene Cartwright, and David Andrews                          | 15 |
| Triple Module Redundancy of a Laser Array Driver Circuit for Optically Reconfigurable Gate Arrays                                               | 16 |
| A Routing Architecture for FPGAs with Dual-VT Switch Box and Logic Clusters                                                                     | 17 |
| Applied RC Applications                                                                                                                         |    |
| Multi-level Customisation Framework for Curve Based Monte Carlo Financial Simulations                                                           | 18 |
| A Low-Cost and High-Performance Virus Scanning Engine Using a Binary CAM Emulator and an MPU                                                    | 20 |
| Cost Effective Implementation of Flux Limiter Functions Using Partial Reconfiguration                                                           | 21 |
| Parallel Tempering MCMC Acceleration Using Reconfigurable Hardware                                                                              | 22 |
| A High Throughput FPGA-Based Implementation of the Lanczos Method for the Symmetric Extremal Eigenvalue Problem                                 | 23 |
| Optimising Performance of Quadrature Methods with Reduced Precision  Anson H.T. Tse, Gary C.T. Chow, Qiwei Jin,  David B. Thomas, and Wayne Luk | 25 |

| Critical Issues in Applied RC                                                                                        |     |
|----------------------------------------------------------------------------------------------------------------------|-----|
| Teaching Hardware/Software Codesign on a Reconfigurable Computing Platform                                           | 264 |
| Securely Sealing Multi-FPGA Systems                                                                                  | 276 |
| FPGA Paranoia: Testing Numerical Properties of FPGA Floating Point IP-Cores                                          | 290 |
| High Performance Reconfigurable Architecture for Double Precision Floating Point Division                            | 302 |
| Posters                                                                                                              |     |
| A Modular-Based Assembly Framework for Autonomous Reconfigurable Systems                                             | 314 |
| Constructing Cluster of Simple FPGA Boards for Cryptologic Computations                                              | 320 |
| Reconfigurable Multicore Architecture for Dynamic Processor Reallocation                                             | 329 |
| Efficient Communication for FPGA Clusters                                                                            | 335 |
| Performance Analysis of Reconfigurable Processors Using MVA Analysis                                                 | 342 |
| PDPR: Fine-Grained Placement for Dynamic Partially Reconfigurable FPGAs                                              | 350 |
| A Connection Router for the Dynamic Reconfiguration of FPGAs Elias Vansteenkiste, Karel Bruneel, and Dirk Stroobandt | 357 |
| R-NoC: An Efficient Packet-Switched Reconfigurable Networks-on-Chip                                                  | 365 |

## XIV Table of Contents

| Novel Arithmetic Architecture for High Performance Implementation of SHA-3 Finalist Keccak on FPGA Platforms | 372 |
|--------------------------------------------------------------------------------------------------------------|-----|
| Kashif Latif, M. Muzaffar Rao, Athar Mahboob, and Arshad Aziz                                                | 312 |
| CRAIS: A Crossbar Based Adaptive Interconnection Scheme                                                      | 379 |
| Author Index                                                                                                 | 385 |