Journals & Magazines >Proceedings of the IEEE >Volume: 111 Issue: 9

Training Spiking Neural Networks Using Lessons From Deep Learning

The brain is the perfect place to look for inspiration to develop more efficient neural networks. Spiking neural networks are pervading many streams of deep learning whic...

Abstract:

The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what...Show More

Metadata

Abstract:

The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This article serves as a tutorial and perspective showing how to apply the lessons learned from several decades of research in deep learning, gradient descent, backpropagation, and neuroscience to biologically plausible spiking neural networks (SNNs). We also explore the delicate interplay between encoding data as spikes and the learning process; the challenges and solutions of applying gradient-based learning to SNNs; the subtle link between temporal backpropagation and spike timing-dependent plasticity; and how deep learning might move toward biologically plausible online learning. Some ideas are well accepted and commonly used among the neuromorphic engineering community, while others are presented or justified for the first time here. A series of companion interactive tutorials complementary to this article using our Python package, snnTorch, are also made available: https://snntorch.readthedocs.io/en/latest/tutorials/index.html.

The brain is the perfect place to look for inspiration to develop more efficient neural networks. Spiking neural networks are pervading many streams of deep learning whic...

Published in: Proceedings of the IEEE ( Volume: 111, Issue: 9, September 2023)

Page(s): 1016 - 1054

Date of Publication: 06 September 2023

ISSN Information:

DOI: 10.1109/JPROC.2023.3308088

Funding Agency:

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

Deep learning has solved numerous problems in computer vision [1], [2], [3], [4], [5], [6], speech recognition [7], [8], [9], and natural language processing [10], [11], [12], [13], [14]. Neural networks have been instrumental in outperforming world champions in a diverse range of games from Go to Starcraft [15], [16], and they are now surpassing the diagnostic capability of clinical specialists in numerous medical tasks [17], [18], [19], [20], [21]. However, for all the state-of-the-art models designed every day, a Kaggle [22] contest for state-of-the-art energy efficiency would go to the brain, every time. A new generation of brain-inspired spiking neural networks (SNNs) is poised to bridge this efficiency gap.

The amount of computational power required to run top-performing deep learning models has increased at a rate of $10\times $ per year from 2012 to 2019 [23], [24]. The rate of data generation is likewise increasing at an exponential rate. OpenAI’s language model, GPT-3, contains 175 billion learnable parameters, estimated to consume roughly 190 000 kWh to train [25], [26], [27]. Meanwhile, our brains operate within ~12–20 W of power. This is in addition to churning through a multitude of sensory inputs, all the while ensuring that our involuntary biological processes do not shut down [28]. If our brains dissipated as much heat as state-of-the-art deep learning models, then natural selection would have wiped humanity out long before we could have invented machine learning. To be fair, none of the authors can emulate the style of Shakespeare or write up musical guitar tabs with the same artistic flair as ChatGPT.

A. Neuromorphic Computing: A Quick Snapshot

Neuromorphic (“brain-like”) engineering strives to imitate the computational principles of the brain to drive down the energy cost of artificial intelligence systems. To replicate a biological system, we build on three parts.

Neuromorphic sensors that take inspiration from biological sensors, such as the retina or cochlear, and typically record changes in a signal instead of sampling it at regular intervals. Signals are only generated when a change occurs, and the signal is referred to as a “spike.”
Neuromorphic algorithms that learn to make sense of spikes are known as SNNs. Instead of floating point values, SNNs work with single-bit, binary activations (spikes) that encode information over time, rather than in an intensity. As such, SNNs take advantage of low-precision parameters and high spatial and temporal sparsity.
These models are designed with power-efficient execution on specialized neuromorphic hardware in mind. Sparse activations reduce data movement both on and off a chip to accelerate neuromorphic workloads, which can lead to large power and latency gains compared to the same task on conventional hardware.

Armed with these three components, neuromorphic systems are equipped to bridge the efficiency gap between today’s and future intelligent systems.

What lessons can be learned from the brain to build more efficient neural networks? Should we replicate the genetic makeup of a neuron right down to the molecular level [29], [30]? Do we look at the way memory and processing coalesce within neurons and synapses [31], [32]? Or should we aim to extract the learning algorithms that underpin the brain [33]? This article hones in on the intricacies of training brain-inspired neuromorphic algorithms, ultimately moving toward the goal of harnessing natural intelligence to further improve our use of artificial intelligence. SNNs can already be optimized using the tools available to the deep learning community. However, the brain-inspired nature of these emerging sensors, neuron models, and training methods is different enough to warrant a deep dive into biologically inspired neural networks.

B. Neuromorphic Systems in the Wild

The overarching aim is to combine artificial neural networks (ANNs), which have already proven their worth in a broad range of domains, with the potential efficiency of SNNs [34]. So far, SNNs have staked their claim to a range of applications where power efficiency is of utmost importance.

Fig. 1 offers a small window into the uses of SNNs, and their domain only continues to expand. Spiking algorithms have been used to implement low-power artificial intelligence algorithms across the medical, robotics, and mixed-reality domains, among many other fields. Given their power efficiency, initial commercial products often target edge computing applications, close to where the data are recorded.

Fig. 1.

SNNs have pervaded many streams of deep learning, which are in need of low-power, resource-constrained, and often portable operation. The utility of SNNs even extends to the modeling of neural dynamics across individual neurons and higher level neural systems.

Training Spiking Neural Networks Using Lessons From Deep Learning

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Introduction

A. Neuromorphic Computing: A Quick Snapshot

B. Neuromorphic Systems in the Wild

C. Overview of This Article

From Artificial to Spiking Neural Networks

A. Spiking Neurons

B. Alternative Spiking Neuron Models

Neural Code

A. Input Encoding

1) Rate-Coded Inputs:

2) Latency-Coded Inputs:

3) Delta Modulated Inputs:

B. Output Decoding

1) Rate-Coded Outputs:

2) Latency-Coded Outputs:

3) Population-Coded Outputs:

4) Rate Versus Latency Code:

C. Objective Functions

1) Spike Rate Objective Functions:

2) Spike Time Objectives:

D. Learning Rules

1) Spatial and Temporal Credit Assignment:

2) Biologically Motivated Learning Rules:

E. Activity Regularization

Training Spiking Neural Networks

A. Shadow Training

B. Backpropagation Using Spike Times

C. Backpropagation Using Spikes

D. Surrogate Gradients

E. Bag of Tricks in BPTT With SNNs

F. Intersection Between Backprop and Local Learning

G. Long-Term Temporal Dependencies

Online Learning

A. Temporal Locality

B. Real-Time Recurrent Learning

C. RTRL Variants in SNNs

D. Spatial Locality

Outlook

Additional Materials

ACKNOWLEDGMENT

Appendix

Appendix

From Artificial to Spiking Neural Networks

1) Forward Euler Method to Solving Spiking Neuron Models:

Spike Encoding

1) Rate-Coded Input Conversion:

2) Latency-Coded Input Conversion:

3) Rate-Coded Outputs:

4) Cross-Entropy Spike Rate:

5) Mean Square Spike Rate:

6) Maximum Membrane:

7) Mean Square Membrane:

8) Cross-Entropy Latency Code:

9) Mean Square Spike Time:

10) Mean Square Relative Spike Time:

11) Population Level Regularization:

12) Neuron Level Regularization:

Training Spiking Neural Networks

1) Backpropagation Using Spike Times:

2) Backpropagation Using Spikes:

3) Long-Term Temporal Dependencies:

References

IEEE Account

Purchase Details

Profile Information

Need Help?