Journals & Magazines >IEEE Transactions on Neural N... >Volume: 28 Issue: 6

Sparseness Analysis in the Pretraining of Deep Neural Networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

A major progress in deep multilayer neural networks (DNNs) is the invention of various unsupervised pretraining methods to initialize network parameters which lead to goo...Show More

Metadata

Abstract:

A major progress in deep multilayer neural networks (DNNs) is the invention of various unsupervised pretraining methods to initialize network parameters which lead to good prediction accuracy. This paper presents the sparseness analysis on the hidden unit in the pretraining process. In particular, we use the

$L_{1}$ -norm to measure sparseness and provide some sufficient conditions for that pretraining leads to sparseness with respect to the popular pretraining models—such as denoising autoencoders (DAEs) and restricted Boltzmann machines (RBMs). Our experimental results demonstrate that when the sufficient conditions are satisfied, the pretraining models lead to sparseness. Our experiments also reveal that when using the sigmoid activation functions, pretraining plays an important sparseness role in DNNs with sigmoid (Dsigm), and when using the rectifier linear unit (ReLU) activation functions, pretraining becomes less effective for DNNs with ReLU (Drelu). Luckily, Drelu can reach a higher recognition accuracy than DNNs with pretraining (DAEs and RBMs), as it can capture the main benefit (such as sparseness-encouraging) of pretraining in Dsigm. However, ReLU is not adapted to the different firing rates in biological neurons, because the firing rate actually changes along with the varying membrane resistances. To address this problem, we further propose a family of rectifier piecewise linear units (RePLUs) to fit the different firing rates. The experimental results show that the performance of RePLU is better than ReLU, and is comparable with those with some pretraining techniques, such as RBMs and DAEs.

Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 28, Issue: 6, June 2017)

Page(s): 1425 - 1438

Date of Publication: 31 March 2016

ISSN Information:

PubMed ID: 27046912

DOI: 10.1109/TNNLS.2016.2541681

Funding Agency:

Contents

I. Introduction

In the past several decades, the backpropagation (BP) algorithm has been used to train deep multilayer neural networks (DNNs) inspired by the architectural depth of the brain [1]. However, it is well known that if one trains DNNs with the BP algorithm from randomly initialized parameters, one typically will end up with models that have poor prediction performance [2]–[5].

References is not available for this document.

Sparseness Analysis in the Pretraining of Deep Neural Networks

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Sparseness Analysis in the Pretraining of Deep Neural Networks

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References