The acceleration of pipeline workloads under the FPGA area and bandwidth constraints

Wei Ning Huang*, Sheng Wei Cheng, Che Wei Chang, Yu Chen Wu, Tei Wei Kuo, Yung Chin Hsu, Wen Yih Isaac Tseng, Shih Hao Hung

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This work is motivated by the advance of heterogeneous computing and the strong demands of workload acceleration in practice. By considering pipeline workloads over FPGA, this paper explores a systematic methodology to configure the hardware instances of each pipeline stage such that the maximum of the execution time of each stage is minimized, where the FPGA allocation with the memory bandwidth constraint is considered. For the target problem, an algorithm is proposed and proved being optimal, and a real implementation study is conducted. In the experimental results, an image filter FPGA implementation can outperform the CPU, GPU, and baseline FPGA solutions by 460%, 73%, and 1030%, respectively. Extensive simulations were also conducted with a large FPGA size to show the scalability of this work.

Original languageEnglish
Title of host publicationRTCSA 2014 - 20th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781479939534
DOIs
StatePublished - 25 09 2014
Event20th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2014 - Chongqing, China
Duration: 20 08 201422 08 2014

Publication series

NameRTCSA 2014 - 20th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications

Conference

Conference20th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2014
Country/TerritoryChina
CityChongqing
Period20/08/1422/08/14

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Fingerprint

Dive into the research topics of 'The acceleration of pipeline workloads under the FPGA area and bandwidth constraints'. Together they form a unique fingerprint.

Cite this