Loading [MathJax]/extensions/MathMenu.js
Layered object detection for multi-class segmentation | IEEE Conference Publication | IEEE Xplore

Layered object detection for multi-class segmentation


Abstract:

We formulate a layered model for object detection and multi-class segmentation. Our system uses the output of a bank of object detectors in order to define shape priors f...Show More

Abstract:

We formulate a layered model for object detection and multi-class segmentation. Our system uses the output of a bank of object detectors in order to define shape priors for support masks and then estimates appearance, depth ordering and labeling of pixels in the image. We train our system on the PASCAL segmentation challenge dataset and show good test results with state of the art performance in several categories including segmenting humans.
Date of Conference: 13-18 June 2010
Date Added to IEEE Xplore: 05 August 2010
ISBN Information:

ISSN Information:

Conference Location: San Francisco, CA, USA
References is not available for this document.

1. Introduction

Object detection is a fundamental task in computer vision. Most approaches formulate the problem as that of predicting a bounding box enclosing the object of interest – this is, for example, the evaluation criteria in the popular PASCAL Visual Object Recognition Challenge (VOC) [3]. However, a bounding-box output is clearly limited. For many objects, particularly those with complex, articulated shapes, a bounding box provides a poor description of the support of the object in the image. At the other extreme, one can attempt to produce an object class label for every pixel in the image. This is usually termed multi-class segmentation.

Select All
1.
P. Arbelaez, M. Maire, C. Fowlkes and J. Malik, "From contours to regions: An empirical evaluation", CVPR, vol. 4, 2009.
2.
N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection", CVPR, vol. 1, pp. 886-893, 2005.
3.
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn and A. Zisserman, The PASCAL Visual Object Classes Challenge 2009 (VOC2009) Results, no. 1, 5, 6, 7.
4.
P. F. Felzenszwalb, R. B. Girshick, D. McAllester and D. Ramanan, "Object detection with discriminatively trained part based models", IEEE PAMI, no. 1, 5, 6, 2009.
5.
R. Gao, T. Wu, S. Zhu and N. Sang, "Bayesian inference for layer representation with mixed markov random field", Energy Minimization Methods in CVPR, vol. 2, pp. 213-224.
6.
X. He, R. Zemel and M. Carreira-Perpinan, "Multiscale conditional random fields for image labeling", CVPR, vol. 2, no. 1, 2004.
7.
D. Hoiem, A. Efros and M. Hebert, "Putting objects in perspective", IJCV, vol. 80, no. 1, pp. 3-15, 2008.
8.
D. Hoiem, C. Rother and J. Wipp, "3d layout crf for multiview object class recognition and segmentation", CVPR, vol. 2, 2007.
9.
D. Hoiem, A. Stein, A. Efros and M. Hebert, "Recovering occlusion boundaries from a single image", ICCV, vol. 2, 2007.
10.
N. Jojic and B. Frey, "Learning flexible sprites in video layers", CVPR, vol. 1, no. 2, 2001.
11.
M. Kumar, P. Ton and A. Zisserman, "Obj cut", CVPR, vol. 1, no. 2, 2005.
12.
M. Kumar, P. Torr and A. Zisserman, "Learning layered pictorial structures from video", Indian Conf on Comp Vis Graphics and Image Proc, vol. 2, pp. 158-163, 2004.
13.
S. Kumar and M. Hebert, "A hierarchical field framework for unified context-based classification", ICCV, vol. 2, no. 1, 2005.
14.
B. Leibe, A. Leonardis and B. Schiele, "Combined object categorization and segmentation with an implicit shape model", ECCV 04 workshop on statistical learning in computer vision, vol. 2, pp. 17-32, 2004.
15.
A. Levin and Y. Weiss, "Learning to combine bottom-up and top-down segmentation", International Journal of Computer Vision, vol. 81, no. 1, pp. 105-118, 2009.
16.
M. Nitzberg, D. Mumford and T. Shiota, Filtering Segmentation and Depth, Springer-Verlag, vol. 2, 1993.
17.
D. Ramanan, "Using segmentation to verify object hypotheses", CVPR, vol. 2, 2006.
18.
X. Ren, C. Fowlkes and J. Malik, "Figure/ground assignment in natural images", ECCV, vol. 6, 2006.
19.
A. Saxena, M. Sun and A. Ng, "Make3D: Learning 3D Scene Structure from a Single Still Image", IEEE TPAMI, vol. 2, pp. 824-840, 2009.
20.
J. Shotton, J. Winn, C. Rother and A. Criminisi, "Texton-boost: Joint appearance shape and context modeling for multi-class object recognition and segmentation", ECCV, vol. 1, pp. 3951:1, 2006.
21.
A. Torralba, K. Murphy and W. Freeman, "Contextual models for object detection using boosted random fields", NIPS, vol. 1, 2004.
22.
Z. Tu, "Auto-context and its application to high-level vision tasks", IEEE CVPR, vol. 1, 2008.
23.
Z. Tu, X. Chen, A. Yuille and S. Zhu, "Image parsing: Unifying segmentation detection and recognition", IJCV, vol. 63, no. 2, pp. 113-140, 2005.
24.
P. A. Viola and M. J. Jones, "Robust real-time face detection", IJCV, vol. 57, no. 2, pp. 137-154, 2004.
25.
J. Wang and E. Adelson, "Representing moving images with layers", IEEE Trans on Image Processing, vol. 3, no. 5, pp. 625-638, 1994.
26.
J. Winn and J. Shotton, "The layout consistent random field for recognizing and segmenting partially occluded objects", CVPR, vol. 2, 2006.
27.
S. Yu, R. Gross and J. Shi, "Concurrent object recognition and segmentation by graph partitioning", NIPS, vol. 2, pp. 1407-1414, 2003.
Contact IEEE to Subscribe

References

References is not available for this document.