Journals & Magazines >IEEE Transactions on Pattern ... >Volume: 33 Issue: 12

Nonparametric Scene Parsing via Label Transfer

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scen...Show More

Metadata

Abstract:

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

Published in: IEEE Transactions on Pattern Analysis and Machine Intelligence ( Volume: 33, Issue: 12, December 2011)

Page(s): 2368 - 2382

Date of Publication: 30 June 2011

ISSN Information:

PubMed ID: 21709305

DOI: 10.1109/TPAMI.2011.131

Contents

1 Introduction

Scene parsing, or recognizing and segmenting objects in an image, is one of the core problems of computer vision. Traditional approaches to object recognition begin by specifying an object model, such as template matching [8], [49], constellations [13], [15], bags of features [19], [24], [44], [45], or shape models [2], [3], [14], etc. These approaches typically work with a fixed number of object categories and require training generative or discriminative models for each category from training data. In the parsing stage, these systems try to align the learned models to the input image and associate object category labels with pixels, windows, edges, or other image representations. Recently, context information has also been carefully modeled to capture the relationship between objects at the semantic level [20], [22]. Encouraging progress has been made by these models on a variety of object recognition and scene parsing tasks.

References is not available for this document.

MIT Libraries

MIT Libraries

Nonparametric Scene Parsing via Label Transfer

Abstract:

Metadata

Abstract:

ISSN Information:

1 Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Nonparametric Scene Parsing via Label Transfer

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1 Introduction

References