Conferences >2016 IEEE Conference on Compu...

Mining Discriminative Triplets of Patches for Fine-Grained Classification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Fine-grained classification involves distinguishing between similar sub-categories based on subtle differences in highly localized regions, therefore, accurate localizati...Show More

Metadata

Abstract:

Fine-grained classification involves distinguishing between similar sub-categories based on subtle differences in highly localized regions, therefore, accurate localization of discriminative regions remains a major challenge. We describe a patch-based framework to address this problem. We introduce triplets of patches with geometric constraints to improve the accuracy of patch localization, and automatically mine discriminative geometrically-constrained triplets for classification. The resulting approach only requires object bounding boxes. Its effectiveness is demonstrated using four publicly available fine-grained datasets, on which it outperforms or achieves comparable performance to the state-of-the-art in classification.

Published in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 27-30 June 2016

Date Added to IEEE Xplore: 12 December 2016

ISBN Information:

Electronic ISSN: 1063-6919

DOI: 10.1109/CVPR.2016.131

Conference Location: Las Vegas, NV, USA

Contents

1. Introduction

The task of fine-grained classification is to recognize sub-ordinate categories belonging to the same superordinate category [42], [39], [32], [25]. The major challenge is that fine-grained objects share similar overall appearance and only have subtle differences in highly localized regions. To effectively and accurately find these discriminative regions, some previous approaches utilize humans-in-the-loop [4], [7], [40], or require semantic part annotations [30], [2], [1], [3], [12], [47], [48] or 3D models [29], [25]. These methods are effective, but they require extra keypoint/part/3D annotations from humans, which are often expensive to obtain. On the other hand, recent research on discriminative mid-level visual elements mining [9], [36], [8], [21], [27] automatically finds discriminative patches or regions from a huge pool and uses the responses of those discriminative elements as a mid-level representation for classification. However, this approach has mainly been applied to scene classification and not typically to fine-grained classification. This is probably due to the fact that the discriminative patches needed for fine-grained categories need to be more accurately localized than for scene classification.

References is not available for this document.

MIT Libraries

MIT Libraries

Mining Discriminative Triplets of Patches for Fine-Grained Classification

Abstract:

Metadata

Abstract:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Mining Discriminative Triplets of Patches for Fine-Grained Classification

Alerts

Abstract:

Metadata

Abstract:

1. Introduction

References