Conferences >2018 23rd Asia and South Paci...

Fully parallel RRAM synaptic array for implementing binary neural network with (+1, −1) weights and (+1, 0) neurons

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Binary Neural Networks (BNNs) have been recently proposed to improve the area-/energy-efficiency of the machine/deep learning hardware accelerators, which opens an opport...Show More

Metadata

Abstract:

Binary Neural Networks (BNNs) have been recently proposed to improve the area-/energy-efficiency of the machine/deep learning hardware accelerators, which opens an opportunity to use the technologically more mature binary RRAM devices to effectively implement the binary synaptic weights. In addition, the binary neuron activation enables using the sense amplifier instead of the analog-to-digital converter to allow bitwise communication between layers of the neural networks. However, the sense amplifier has intrinsic offset that affects the threshold of binary neuron, thus it may degrade the classification accuracy. In this work, we analyze a fully parallel RRAM synaptic array architecture that implements the fully connected layers in a convolutional neural network with (+1, -1) weights and (+1, 0) neurons. The simulation results with TSMC 65 nm PDK show that the offset of current mode sense amplifier introduces a slight accuracy loss from ~98.5% to ~97.6% for MNIST dataset. Nevertheless, the proposed fully parallel BNN architecture (P-BNN) can achieve 137.35 TOPS/W energy efficiency for the inference, improved by ~20X compared to the sequential BNN architecture (S-BNN) with row-by-row read-out scheme. Moreover, the proposed P-BNN architecture can save the chip area by ~16% as it eliminates the area overhead of MAC peripheral units in the S-BNN architecture.

Published in: 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC)

Date of Conference: 22-25 January 2018

Date Added to IEEE Xplore: 22 February 2018

ISBN Information:

Electronic ISSN: 2153-697X

DOI: 10.1109/ASPDAC.2018.8297384

Conference Location: Jeju, Korea (South)

Contents

I. Introduction

Deep neural networks (DNNs) have shown unprecedented performance on various intelligent tasks such as image recognition, speech recognition, natural language processing, etc. Despite substantial accuracy improvement, the high demands on memory storage and computational resources constrain the on-chip implementation of DNNs. For example, AlexNet [1] has 61M parameters and requires 1.5B high precision operations to classify one image, making it prohibitive to directly implement the entire architecture on chip. As off-chip memory such as DRAM gets involved, the intensive data movements between on-chip processor and off-chip DRAM lead to high energy consumption. Recently, Binary Neural Networks (BNNs) [2] [3] have been proposed to provide comparable classification accuracy to conventional high-precision neural network on various datasets (e.g., MNIST, CIFAR 10, and ImageNet). In these BNNs, the weights and neuron values are truncated to binary values (i.e., and −1), thus the memory storage size is drastically reduced. Moreover, high-precision multiply operations can be replaced by bit-wise operations, reducing the computational workloads as well. Thus, BNNs provide a promising solution for the on-chip implementation of DNNs.

References is not available for this document.

Fully parallel RRAM synaptic array for implementing binary neural network with (+1, −1) weights and (+1, 0) neurons

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Fully parallel RRAM synaptic array for implementing binary neural network with (+1, −1) weights and (+1, 0) neurons

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References