Conferences >2003 IEEE Computer Society Co...

Object class recognition by unsupervised scale-invariant learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexibl...Show More

Metadata

Abstract:

We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).

Published in: 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings.

Date of Conference: 18-20 June 2003

Date Added to IEEE Xplore: 15 July 2003

Print ISBN:0-7695-1900-8

Print ISSN: 1063-6919

DOI: 10.1109/CVPR.2003.1211479

Conference Location: Madison, WI, USA

Contents

1. Introduction

Representation, detection and learning are the main issues that need to be tackled in designing a visual system for recognizing object categories. The first challenge is coming up with models that can capture the ‘essence’ of a category, i.e. what is common to the objects that belong to it, and yet are flexible enough to accommodate object variability (e.g. presence/absence of distinctive parts such as mustache and glasses, variability in overall shape, changing appearance due to lighting conditions, viewpoint etc). The challenge of detection is defining metrics and inventing algorithms that are suitable for matching models to images efficiently in the presence of occlusion and clutter. Learning is the ultimate challenge. If we wish to be able to design visual systems that can recognize, say, 10,000 object categories, then effortless learning is a crucial step. This means that the training sets should be small and that the operator-assisted steps that are required (e.g. elimination of clutter in the background of the object, scale normalization of the training sample) should be reduced to a minimum or eliminated.

References is not available for this document.

Object class recognition by unsupervised scale-invariant learning

Abstract:

Metadata

Abstract:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Object class recognition by unsupervised scale-invariant learning

Alerts

Abstract:

Metadata

Abstract:

1. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References