Conferences >2009 IEEE 12th International ...

Decomposing a scene into geometric and semantically consistent regions

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D relationships between them. This requires a representation above the le...Show More

Metadata

Abstract:

High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D relationships between them. This requires a representation above the level of pixels that can be endowed with high-level attributes such as class of object/region, its orientation, and (rough 3D) location within the scene. Towards this goal, we propose a region-based model which combines appearance and scene geometry to automatically decompose a scene into semantically meaningful regions. Our model is defined in terms of a unified energy function over scene appearance and structure. We show how this energy function can be learned from data and present an efficient inference technique that makes use of multiple over-segmentations of the image to propose moves in the energy-space. We show, experimentally, that our method achieves state-of-the-art performance on the tasks of both multi-class image segmentation and geometric reasoning. Finally, by understanding region classes and geometry, we show how our model can be used as the basis for 3D reconstruction of the scene.

Published in: 2009 IEEE 12th International Conference on Computer Vision

Date of Conference: 29 September 2009 - 02 October 2009

Date Added to IEEE Xplore: 06 May 2010

ISBN Information:

ISSN Information:

DOI: 10.1109/ICCV.2009.5459211

Conference Location: Kyoto, Japan

Stephen Gould

Department of Electrical Engineering, University of Stanford, USA

Richard Fulton

Department of Computer Science, University of Stanford, USA

Daphne Koller

Department of Computer Science, University of Stanford, USA

Contents

1. Introduction

With recent success on many vision subtasks-object detection [21], [18], [3], multi-class image segmentation [17], [7], [13], and 3D reconstruction [10], [16]—holistic scene understanding has emerged as one of the next great challenges for computer vision [11], [9], [19]. Here the aim is to reason jointly about objects, regions and geometry of a scene with the hope of avoiding the many errors induced by modeling these tasks in isolation.

Stephen Gould

Department of Electrical Engineering, University of Stanford, USA

Richard Fulton

Department of Computer Science, University of Stanford, USA

Daphne Koller

Department of Computer Science, University of Stanford, USA

References is not available for this document.

MIT Libraries

MIT Libraries

Decomposing a scene into geometric and semantically consistent regions

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Decomposing a scene into geometric and semantically consistent regions

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References