High-Performance Asynchronous CNN Accelerator with Early Termination

Tan Rong Loo, T. Hui Teo, Mulat Ayinet Tiruye, I. Chyn Wey

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolutional Neural Network (CNN), especially very deep networks, are highly computation intensive, resulting in long delays and high-power consumption. The dynamically varying environmental conditions in real world inference can result in highly complex problems, and hence the need for these inefficient deep networks to guarantee satisfactory accuracy all the time. Several studies employ approximation techniques to execute partial computation of the network, in attempt to reduce the amount of computation were unnecessary. However, such approaches are still highly sequential in nature, since they still need to run the whole network. This paper proposes an early termination architecture on an already-trained CNN to allow for testing the partial results midway through the network, reducing computations by terminating the main network when it is sufficient as the inference results. The first proposal is implemented in synchronous circuit, however, due to its nature all memory elements are required to capture even when no new data is generated. The second proposal employs the use of asynchronous circuit to significantly reducing power consumption and further sped up the architecture since an operation need not wait the slowest critical path in the circuit. The proposed circuits were designed on FPGA platform. The results of the asynchronous circuit show a nearly 20% increment in speed with about 12% reduction in power consumption in comparison with a synchronous circuit.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE 15th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages140-144
Number of pages5
ISBN (Electronic)9781665464994
DOIs
StatePublished - 2022
Event15th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022 - Penang, Malaysia
Duration: 19 12 202222 12 2022

Publication series

NameProceedings - 2022 IEEE 15th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022

Conference

Conference15th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2022
Country/TerritoryMalaysia
CityPenang
Period19/12/2222/12/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Keywords

  • asynchronous circuit
  • convolutional neural network
  • early termination
  • field-programmable gate array

Fingerprint

Dive into the research topics of 'High-Performance Asynchronous CNN Accelerator with Early Termination'. Together they form a unique fingerprint.

Cite this