Conferences >2014 IEEE Conference on Compu...

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that thi...Show More

Metadata

Abstract:

Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13. We use features extracted from the OverFeat network as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets. We selected these tasks and datasets as they gradually move further away from the original task and data the OverFeat network was trained to solve. Astonishingly, we report consistent superior results compared to the highly tuned state-of-the-art systems in all the visual classification tasks on various datasets. For instance retrieval it consistently outperforms low memory footprint methods except for sculptures dataset. The results are achieved using a linear SVM classifier (or L2 distance in case of retrieval) applied to a feature representation of size 4096 extracted from a layer in the net. The representations are further modified using simple augmentation techniques e.g. jittering. The results strongly suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

Published in: 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops

Date of Conference: 23-28 June 2014

Date Added to IEEE Xplore: 25 September 2014

Electronic ISBN:978-1-4799-4308-1

ISSN Information:

DOI: 10.1109/CVPRW.2014.131

Conference Location: Columbus, OH, USA

Contents

1. Introduction

“Deep learning. How well do you think it would work for your computer vision problem?” Most likely this question has been posed in your group's coffee room. And in response someone has quoted recent success stories [29], [15], [10]– and someone else professed skepticism. You may have left the coffee room slightly dejected thinking “Pity I have neither the time, GPU programming skills nor large amount of labelled data to train my own network to quickly find out the answer”. But when the convolutional neural network OverFeat [38] was recently made publicly available¹

There are other publicly available deep learning implementations such as Alex Krizhevsky's ConvNet and Berkeley's Caffe. Benchmarking these implementations is beyond the scope of this paper.

it allowed for some experimentation. In particular we wondered now, not whether one could train a deep network specifically for a given task, but if the features extracted by a deep network - one carefully trained on the diverse ImageNet database to perform the specific task of image classification - could be exploited for a wide variety of vision tasks. We now relate our discussions and general findings because as a computer vision researcher you've probably had the same questions: Figure 1:

Top) cnn representation replaces pipelines of s. o. a methods and achieve better results. E. g. Dpd [50]. Bottom) augmented cnn representation with linear svm consistently outperforms s. o. a. On multiple tasks. Specialized cnn refers to other works which specifically designed the cnn for their task

References is not available for this document.

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References