Abstract
Convolutional neural networks (CNN) in deep learning have become popular in many of the latest applications from speech recognition to image classification and object detection. Among them, YOLO (You only look once) is a well-known algorithm in object detection. YOLO convolutional neural networks require a lot of multiplication and accumulation calculations. On the edge, special hardware needs to be designed to speed up the calculation. In order to reduce hardware costs, a new distributed arithmetic (DA) architecture similar to NEDA is proposed. The multipliers is replaced by adders. The purpose is to reduce the cost of power consumption and area while maintaining high speed and high precision. Mathematical analysis proves that DA can only use addition to achieve multiplication in the form of two's complement, and then perform data shift at the end to implement the operation of the adder, not the multiplier. In addition, in this paper, after convolution, maximum pooling is performed to reduce the bandwidth. Finally, the biggest feature of this article is that PE can perform 1.78 MAC operations in one clock cycle.
Original language | English |
---|---|
Title of host publication | 2020 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781728173993 |
DOIs | |
State | Published - 28 09 2020 |
Externally published | Yes |
Event | 7th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020 - Taoyuan, Taiwan Duration: 28 09 2020 → 30 09 2020 |
Publication series
Name | 2020 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020 |
---|
Conference
Conference | 7th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020 |
---|---|
Country/Territory | Taiwan |
City | Taoyuan |
Period | 28/09/20 → 30/09/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.