Journals & Magazines >IEEE Transactions on Circuits... >Volume: 70 Issue: 10

WDVR-RAM: A 0.25–1.2 V, 2.6–76 POPS/W Charge-Domain In-Memory-Computing Binarized CNN Accelerator for Dynamic AIoT Workloads

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In-memory computing (IMC) is an effective approach to accelerate the interference tasks of binary neural networks (BNN), which has been widely used in artificial intellig...Show More

Metadata

Abstract:

In-memory computing (IMC) is an effective approach to accelerate the interference tasks of binary neural networks (BNN), which has been widely used in artificial intelligence of things (AIoT) applications. However, previous researches only focus on optimizing energy efficiency for a narrow voltage range. This poses a severe limitation for some AIoT applications, because they may have very dynamic workloads that require the energy efficiency optimization of BNN for a wide dynamic voltage range (WDVR). To address this issue, we have developed a novel IMC-based BNN accelerator, supporting energy-efficient operations in a wide dynamic voltage range. First, we analyze different charge-domain architectures in terms of their errors and energy characteristics, and decide the best architecture that is suitable for a wide dynamic voltage range. Second, we introduce a lazy convolution bitline reset (LCBR) scheme to further optimize the energy of multiplication and accumulation (MAC) within the entire voltage range. Third, we design a subthreshold differential batch-normalization amplifier (SDBNA) array to compensate for the inference accuracy loss for the lower end of the voltage range. A 16-Kb WDVR-RAM has been designed and fabricated in a 55-nm CMOS process. Measurement results show that the test chip achieves a peak energy efficiency of 76193-2645 TOPS/W at 0.25-1.2 V for MAC, and a peak energy efficiency of 56141-1973 TOPS/W at 0.25-1.2V for both MAC and batch normalization (BN) layers. For throughput, our work obtains a peak computing density range of 692-144078 GOPS/mm2 with 28.31-0.17 ms/frame (CIFAR-10). Moreover, it also achieves the highest 14.75% and 31.87% recovery for Top-1 accuracy observed for MNIST and CIFAR-10 so far, respectively.

Published in: IEEE Transactions on Circuits and Systems I: Regular Papers ( Volume: 70, Issue: 10, October 2023)

Page(s): 3964 - 3977

Date of Publication: 26 July 2023

ISSN Information:

DOI: 10.1109/TCSI.2023.3294296

Funding Agency:

Contents

I. Introduction

The binary neural network (BNN) [1], [2] has been a good choice to achieve energy efficiency required in artificial intelligence of things (AIoT) [3], [4], [5], [6] applications. Both of its pre-trained weights and input activations are aggressively quantized to ±1. The computations of its binary convolution layers are simplified to the XNOR operations between the weights and activations. The activation process is simplified to perform a binary function. As a result, substantial savings of hardware resource and energy have been achieved while providing acceptable computational accuracy for AIoT inference tasks. To further optimize the energy efficiency of BNN, promising computing architectures such as in-memory computing (IMC) [7], [8], [9], [10], [11], [12], [13], [14], [15], [16] are also actively explored to minimize the excessive data movement between arithmetic/logic units and on-chip memories.

References is not available for this document.

MIT Libraries

MIT Libraries

WDVR-RAM: A 0.25–1.2 V, 2.6–76 POPS/W Charge-Domain In-Memory-Computing Binarized CNN Accelerator for Dynamic AIoT Workloads

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

WDVR-RAM: A 0.25–1.2 V, 2.6–76 POPS/W Charge-Domain In-Memory-Computing Binarized CNN Accelerator for Dynamic AIoT Workloads

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References