Abstract
Eliminating redundant computations is a common approach to improve the performance of ReRAM-based DNN accelerators. While existing practical ReRAM-based accelerators eliminate part of the redundant computations by exploiting sparsity in inputs and weights or utilizing weight patterns of DNN models, they fail to identify all the redundancy, resulting in many unnecessary computations. Thus, we propose a practical design, RePIM, that is the first to jointly exploit the repetition of both inputs and weights. Our evaluation shows that RePIM is effective in eliminating unnecessary computations, achieving an average of 15.24× speedup and 96.07% energy savings over the state-of-the-art practical ReRAM-based accelerator.
Original language | English |
---|---|
Title of host publication | 2021 58th ACM/IEEE Design Automation Conference, DAC 2021 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 589-594 |
Number of pages | 6 |
ISBN (Electronic) | 9781665432740 |
DOIs | |
State | Published - 05 12 2021 |
Externally published | Yes |
Event | 58th ACM/IEEE Design Automation Conference, DAC 2021 - San Francisco, United States Duration: 05 12 2021 → 09 12 2021 |
Publication series
Name | Proceedings - Design Automation Conference |
---|---|
Volume | 2021-December |
ISSN (Print) | 0738-100X |
Conference
Conference | 58th ACM/IEEE Design Automation Conference, DAC 2021 |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 05/12/21 → 09/12/21 |
Bibliographical note
Publisher Copyright:© 2021 IEEE.