research-article

Swap Based Merge Network for High Performance Sorting Accelerators

Author:
Kenji Kise

Tokyo Institute of Technology

Tokyo Institute of Technology
View Profile

HEART '18: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable TechnologiesJune 2018Article No.: 8Pages 1–7https://doi.org/10.1145/3241793.3241801

Published:20 June 2018Publication History

HEART '18: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies

Pages 1–7

ABSTRACT

A hardware module called merge network is the key module for constructing FPGA-based sorting accelerators. Therefore, we propose a novel merge network based on compare and swap operations for high performance sorting accelerators. Our proposal is based on the state-of-the-art merge network and it tries to mitigate its drawback that the maximum wiring delay and the maximum fanout increase when the number of records output per cycle is increased.

We implement some merge networks adopting the proposal on a Virtex-7 FPGA. The evaluation results show that the maximum fanout of the proposal is constant, and the maximum wiring delay of the proposal is almost constant. Because of these desirable properties, the proposal of the largest configuration achieves 1.43x higher throughput than the state-of-the-art merge network.

References

Jared Casper and Kunle Olukotun. 2014. Hardware Acceleration of Database Operations. In Proceedings of the 2014 ACM/SIGDA International Symposium on Field-programmable Gate Arrays (FPGA '14). ACM, New York, NY, USA, 151--160. Google ScholarDigital Library
Minsik Cho, Daniel Brand, Rajesh Bordawekar, Ulrich Finkler, Vincent Kulandaisamy, and Ruchir Puri. 2015. PARADIS: An Efficient Parallel Algorithm for In place Radix Sort. Proc. VLDB Endow. 8, 12 (Aug. 2015), 1518--1529. Google ScholarDigital Library
Andrew Davidson, David Tarjan, Michael Garland, and John D. Owens. 2012. Efficient parallel merge sort for fixed and variable length keys. In 2012 Innovative Parallel Computing (InPar). IEEE, 1--9.Google Scholar
Hiroshi Inoue and Kenjiro Taura. 2015. SIMD- and Cache-friendly Algorithm for Sorting an Array of Structures. Proc. VLDB Endow. 8, 11 (July 2015), 1274--1285. Google ScholarDigital Library
Dirk Koch and Jim Torresen. 2011. FPGASort: A High Performance Sorting Architecture Exploiting Run-time Reconfiguration on Fpgas for Large Problem Sorting. In Proceedings of the 19th ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA '11). ACM, New York, NY, USA, 45--54. Google ScholarDigital Library
Susumu Mashimo, Thiem Van Chu, and Kenji Kise. 2017. High-Performance Hardware Merge Sorter. In 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). 1--8.Google Scholar
Duane Merrill and Andrew Grimshaw. 2011. High Performance and Scalable Radix Sorting: A case study of implementing dynamic parallelism for GPU computing. Parallel Processing Letters (PPL) 21, 02 (2011), 245--272.Google ScholarCross Ref
Makoto Saitoh, Elsayed A. Elsayed, Thiem Van Chu, Susumu Mashimo, and Kenji Kise. 2018. High-Performance and Cost-Effective Hardware Merge Sorter without Feedback Datapath. In 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). 197--204.Google ScholarCross Ref
Wei Song, Dirk Koch, Mikel Luján, and Jim Garside. 2016. Parallel Hardware Merge Sorter. In 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). 95--102.Google Scholar

Recommendations

Cost-Effective and High-Throughput Merge Network: Architecture for the Fastest FPGA Sorting Accelerator
HEART '16

High-performance sorting is used in various areas such as database transactions and genomic feature operations. To improve sorting performance, in addition to the conventional approach of using general purpose processors or GPUs, the approach of using ...
Read More
From software to accelerators with LegUp high-level synthesis
CASES '13: Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems

Embedded system designers can achieve energy and performance benefits by using dedicated hardware accelerators. However, implementing custom hardware accelerators for an application can be difficult and time intensive. LegUp is an open-source high-level ...
Read More
Implementing high-performance, low-power FPGA-based optical flow accelerators in C
ASAP '13: Proceedings of the 2013 IEEE 24th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Recent developments in High-Level Synthesis (HLS) for FPGAs are making it possible to “run” C code on FPGAs thereby making modern programming environments available to FPGA developers. In this paper, C code for a complex optical-flow algorithm is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

HEART '18: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies
June 2018
125 pages
ISBN:9781450365420
DOI:10.1145/3241793

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 June 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
FPGA
merge network
sorting accelerator
swap operation
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate22of50submissions,44%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 61
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Swap Based Merge Network for High Performance Sorting Accelerators

HEART '18: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies

ABSTRACT

References

Cited By

Recommendations

Cost-Effective and High-Throughput Merge Network: Architecture for the Fastest FPGA Sorting Accelerator

From software to accelerators with LegUp high-level synthesis

Implementing high-performance, low-power FPGA-based optical flow accelerators in C

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Swap Based Merge Network for High Performance Sorting Accelerators

HEART '18: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies

ABSTRACT

References

Cited By

Recommendations

Cost-Effective and High-Throughput Merge Network: Architecture for the Fastest FPGA Sorting Accelerator

From software to accelerators with LegUp high-level synthesis

Implementing high-performance, low-power FPGA-based optical flow accelerators in C

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media