Conferences >2016 IEEE Conference on Compu...

Learning Deep Features for Discriminative Localization

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have rem...Show More

Metadata

Abstract:

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have remarkable localization ability despite being trained on imagelevel labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that exposes the implicit attention of CNNs on an image. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014 without training on any bounding box annotation. We demonstrate in a variety of experiments that our network is able to localize the discriminative image regions despite just being trained for solving classification task1.

Published in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 27-30 June 2016

Date Added to IEEE Xplore: 12 December 2016

ISBN Information:

Electronic ISSN: 1063-6919

DOI: 10.1109/CVPR.2016.319

Conference Location: Las Vegas, NV, USA

Contents

1. Introduction

Recent work by Zhou et al [34] has shown that the convolutional units of various layers of convolutional neural networks (CNNs) actually behave as object detectors despite no supervision on the location of the object was provided. Despite having this remarkable ability to localize objects in the convolutional layers, this ability is lost when fully-connected layers are used for classification. Recently some popular fully-convolutional neural networks such as the Network in Network (NIN) [13] and GoogLeNet [25] have been proposed to avoid the use of fully-connected layers to minimize the number of parameters while maintaining high performance.

References is not available for this document.

Learning Deep Features for Discriminative Localization

Abstract:

Metadata

Abstract:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Learning Deep Features for Discriminative Localization

Alerts

Abstract:

Metadata

Abstract:

1. Introduction

References