Loading [MathJax]/extensions/MathMenu.js
Scene Parsing with Object Instances and Occlusion Ordering | IEEE Conference Publication | IEEE Xplore

Scene Parsing with Object Instances and Occlusion Ordering


Abstract:

This work proposes a method to interpret a scene by assigning a semantic label at every pixel and inferring the spatial extent of individual object instances together wit...Show More

Abstract:

This work proposes a method to interpret a scene by assigning a semantic label at every pixel and inferring the spatial extent of individual object instances together with their occlusion relationships. Starting with an initial pixel labeling and a set of candidate object masks for a given test image, we select a subset of objects that explain the image well and have valid overlap relationships and occlusion ordering. This is done by minimizing an integer quadratic program either using a greedy method or a standard solver. Then we alternate between using the object predictions to refine the pixel labels and vice versa. The proposed system obtains promising results on two challenging subsets of the LabelMe and SUN datasets, the largest of which contains 45, 676 images and 232 classes.
Date of Conference: 23-28 June 2014
Date Added to IEEE Xplore: 25 September 2014
Electronic ISBN:978-1-4799-5118-5

ISSN Information:

Conference Location: Columbus, OH, USA
Citations are not available for this document.

1 Introduction

Many state-of-the-art image parsing or semantic segmentation methods attempt to compute a labeling of every pixel or segmentation region in an image [2], [4], [7], [14], [15], [19], [20]. Despite their rapidly increasing accuracy, these methods have several limitations. First, they have no notion of object instances - given an image with multiple nearby or overlapping cars, these methods are likely to produce a blob of “car” labels instead of separately delineated instances (Figure 1(a)). In addition, pixel labeling methods tend to be more accurate for “stuff” classes that are characterized by local appearance rather than overall shape - classes such as road, sky, tree, and building. To do better on “thing” classes such as car, cat, person, and vase - as well as to gain the ability to represent object instances - it becomes necessary to incorporate detectors that model the overall object shape.

Cites in Papers - |

Cites in Papers - IEEE (60)

Select All
1.
Jonathan Tay Yu Liang, Tanaka Kanji, "HO3-SLAM: Human-Object Occlusion Ordering as Add-on for Enhancing Traversability Prediction in Dynamic SLAM", 2023 IEEE 13th International Conference on Pattern Recognition Systems (ICPRS), pp.1-7, 2023.
2.
Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi, "OneFormer: One Transformer to Rule Universal Image Segmentation", 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.2989-2998, 2023.
3.
Yuxin Fang, Xinggang Wang, Rui Wu, Wenyu Liu, "What Makes for Hierarchical Vision Transformer?", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.45, no.10, pp.12714-12720, 2023.
4.
Ana-Maria Simion, Șerban Radu, "Experiments with Semi-Supervised Learning: from Cityscapes to Medical Images", 2023 24th International Conference on Control Systems and Computer Science (CSCS), pp.477-483, 2023.
5.
Lei Ke, Yu-Wing Tai, Chi-Keung Tang, "Occlusion-Aware Instance Segmentation Via BiLayer Network Architectures", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.45, no.8, pp.10197-10211, 2023.
6.
Bo Sun, Jason Kuen, Zhe Lin, Philippos Mordohai, Simon Chen, "PRN: Panoptic Refinement Network", 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.3952-3962, 2023.
7.
Mert Kalfa, Sadık Yağız Yetim, Arda Atalik, Mehmetcan Gök, Yiqun Ge, Rong Li, Wen Tong, Tolga M. Duman, Orhan Arıkan, "Reliable Extraction of Semantic Information and Rate of Innovation Estimation for Graph Signals", IEEE Journal on Selected Areas in Communications, vol.41, no.1, pp.119-140, 2023.
8.
John Lambert, Zhuang Liu, Ozan Sener, James Hays, Vladlen Koltun, "MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.45, no.1, pp.796-810, 2023.
9.
James Ross, Oscar Mendez, Avishkar Saha, Mark Johnson, Richard Bowden, "BEV-SLAM: Building a Globally-Consistent World Map Using Monocular Vision", 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.3830-3836, 2022.
10.
N Dinesh Reddy, Robert Tamburo, Srinivasa G. Narasimhan, "WALT: Watch And Learn 2D amodal representation from Time-lapse imagery", 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.9346-9356, 2022.
11.
Hyunmin Lee, Jaesik Park, "Instance-wise Occlusion and Depth Orders in Natural Scenes", 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.21178-21189, 2022.
12.
Buyu Liu, Bingbing Zhuang, Manmohan Chandraker, "Weakly But Deeply Supervised Occlusion-Reasoned Parametric Road Layouts", 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.16979-16988, 2022.
13.
Lei Ke, Yu-Wing Tai, Chi-Keung Tang, "Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.4018-4027, 2021.
14.
Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon, "Hierarchical Lovász Embeddings for Proposal-free Panoptic Segmentation", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.14408-14418, 2021.
15.
Xiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille, "Robust Instance Segmentation through Reasoning about Multi-Object Occlusion", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.11136-11145, 2021.
16.
Avishkar Saha, Oscar Mendez, Chris Russell, Richard Bowden, "Enabling spatio-temporal aggregation in Birds-Eye-View Vehicle Estimation", 2021 IEEE International Conference on Robotics and Automation (ICRA), pp.5133-5139, 2021.
17.
Yuanyuan Liu, Dingyuan Chen, Ailong Ma, Yanfei Zhong, Fang Fang, Kai Xu, "Multiscale U-Shaped CNN Building Instance Extraction Framework With Edge Constraint for High-Spatial-Resolution Remote Sensing Imagery", IEEE Transactions on Geoscience and Remote Sensing, vol.59, no.7, pp.6106-6120, 2021.
18.
Justin Lazarow, Kwonjoon Lee, Kunyu Shi, Zhuowen Tu, "Learning Instance Occlusion for Panoptic Segmentation", 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.10717-10726, 2020.
19.
Buyu Liu, Bingbing Zhuang, Samuel Schulter, Pan Ji, Manmohan Chandraker, "Understanding Road Layout From Videos as a Whole", 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.4413-4422, 2020.
20.
Ziyu Jiang, Buyu Liu, Samuel Schulter, Zhangyang Wang, Manmohan Chandraker, "Peek-a-Boo: Occlusion Reasoning in Indoor Scenes With Plane Representations", 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.110-118, 2020.
21.
Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy, "Self-Supervised Scene De-Occlusion", 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.3783-3791, 2020.
22.
Shuang Qiu, Yao Zhao, Jianbo Jiao, Yunchao Wei, Shikui Wei, "Referring Image Segmentation by Generative Adversarial Learning", IEEE Transactions on Multimedia, vol.22, no.5, pp.1333-1344, 2020.
23.
Helisa Dhamo, Nassir Navab, Federico Tombari, "Object-Driven Multi-Layer Scene Decomposition From a Single Image", 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp.5368-5377, 2019.
24.
Xiaosheng Yan, Yuanlong Yu, Feigege Wang, Wenxi Liu, Shengfeng He, Jia Pan, "Visualizing the Invisible: Occluded Vehicle Segmentation and Recovery", 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp.7617-7626, 2019.
25.
Alexander Kirillov, Ross Girshick, Kaiming He, Piotr Dollár, "Panoptic Feature Pyramid Networks", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.6392-6401, 2019.
26.
Lorenzo Porzi, Samuel Rota Bulò, Aleksander Colovic, Peter Kontschieder, "Seamless Scene Segmentation", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.8269-8278, 2019.
27.
Chen Liu, Kihwan Kim, Jinwei Gu, Yasutaka Furukawa, Jan Kautz, "PlaneRCNN: 3D Plane Detection and Reconstruction From a Single Image", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.4445-4454, 2019.
28.
Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, Piotr Dollár, "Panoptic Segmentation", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.9396-9405, 2019.
29.
Ziyan Wang, Buyu Liu, Samuel Schulter, Manmohan Chandraker, "A Parametric Top-View Representation of Complex Road Scenes", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.10317-10325, 2019.
30.
Yuan-Ting Hu, Hong-Shuo Chen, Kexin Hui, Jia-Bin Huang, Alexander G. Schwing, "SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation – A Synthetic Dataset and Baselines", 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.3100-3110, 2019.

Cites in Papers - Other Publishers (28)

1.
Congying An, Jingjing Wu, Huanlong Zhang, "Occlusion-aware segmentation via RCF-Pix2Pix generative network", The Visual Computer, 2024.
2.
Sarfaraz Ahmed Mohammed, Anca L. Ralescu, "Insights into Image Understanding: Segmentation Methods for Object Recognition and Scene Classification", Algorithms, vol.17, no.5, pp.189, 2024.
3.
Taichi Fukuda, Kotaro Hasegawa, Shinya Ishizaki, Shohei Nobuhara, Ko Nishino, "BlindSpotNet: Seeing Where We Cannot See", Computer Vision – ECCV 2022 Workshops, vol.13801, pp.554, 2023.
4.
Libo Weng, Yingjie Wang, Fei Gao, "Traffic Scene Perception Based on Joint Object Detection and Semantic Segmentation", Neural Processing Letters, vol.54, no.6, pp.5333, 2022.
5.
Alexy Bhowmick, Sarat Saharia, Shyamanta M. Hazarika, "Non-parametric scene parsing: Label transfer methods and datasets", Computer Vision and Image Understanding, vol.219, pp.103418, 2022.
6.
Chuanxia Zheng, Duy-Son Dao, Guoxian Song, Tat-Jen Cham, Jianfei Cai, "Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition", International Journal of Computer Vision, vol.129, no.12, pp.3195, 2021.
7.
Mert Kalfa, Mehmetcan Gok, Arda Atalik, Busra Tegin, Tolga M. Duman, Orhan Arikan, "Towards goal-oriented semantic signal processing: Applications and future challenges", Digital Signal Processing, vol.119, pp.103134, 2021.
8.
Ujwal Bonde, Pablo F. Alcantarilla, Stefan Leutenegger, "Towards Bounding-Box Free Panoptic Segmentation", Pattern Recognition, vol.12544, pp.316, 2021.
9.
Rohit Mohan, Abhinav Valada, "EfficientPS: Efficient Panoptic Segmentation", International Journal of Computer Vision, vol.129, no.5, pp.1551, 2021.
10.
Takuya Kamitani, Yuta Yamaguchi, Masashi Nishiyama, Yoshio Iwai, "Identification of People by Measuring Body Sway with Self-Occlusion using Top-View Camera", IEEJ Transactions on Electronics, Information and Systems, vol.140, no.6, pp.629, 2020.
11.
Takuya Kamitani, Yuta Yamaguchi, Masashi Nishiyama, Yoshio Iwai, "Identifying People Using Body Sway in Case of Self-occlusion", Frontiers of Computer Vision, vol.1212, pp.136, 2020.
12.
Melani Sanchez-Garcia, Ruben Martinez-Cantin, Jose J. Guerrero, "Semantic and structural image segmentation for prosthetic vision", PLOS ONE, vol.15, no.1, pp.e0227677, 2020.
13.
Ziheng Zhang, Anpei Chen, Ling Xie, Jingyi Yu, Shenghua Gao, "Learning Semantics-aware Distance Map with Semantics Layering Network for Amodal Instance Segmentation", Proceedings of the 27th ACM International Conference on Multimedia, pp.2124, 2019.
14.
Guoxia Wang, Xiaochuan Wang, Frederick W. B. Li, Xiaohui Liang, "DOOBNet: Deep Object Occlusion Boundary Detection from an Image", Computer Vision ? ACCV 2018, vol.11366, pp.686, 2019.
15.
Sergey Kosov, Kimiaki Shirahama, Marcin Grzegorzek, "Labeling of partially occluded regions via the multi-layer CRF", Multimedia Tools and Applications, vol.78, no.2, pp.2551, 2019.
16.
David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi, "Semi-convolutional Operators for Instance Segmentation", Computer Vision ? ECCV 2018, vol.11205, pp.89, 2018.
17.
Rabia Tahir, Sheikh Ziauddin, Ahmad R. Shahid, Asad Safi, "Outdoor Scene Labeling Using ALE and LSC Superpixels", Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, pp.1, 2017.
18.
Fuhai Chen, Rongrong Ji, Jinsong Su, Yongjian Wu, Yunsheng Wu, "StructCap", Proceedings of the 25th ACM international conference on Multimedia, pp.46, 2017.
19.
Jan van den Brand, Matthias Ochs, Rudolf Mester, "Instance-Level Segmentation of Vehicles by Deep Contours", Computer Vision ? ACCV 2016 Workshops, vol.10116, pp.477, 2017.
20.
Holger Caesar, Jasper Uijlings, Vittorio Ferrari, Computer Vision ? ECCV 2016, vol.9905, pp.381, 2016.
21.
Ke Li, Jitendra Malik, Computer Vision ? ECCV 2016, vol.9906, pp.677, 2016.
22.
Bernardino Romera-Paredes, Philip Hilaire Sean Torr, Computer Vision ? ECCV 2016, vol.9910, pp.312, 2016.
23.
Jonas Uhrig, Marius Cordts, Uwe Franke, Thomas Brox, Pattern Recognition, vol.9796, pp.14, 2016.
24.
Hongyuan Zhu, Fanman Meng, Jianfei Cai, Shijian Lu, "Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation", Journal of Visual Communication and Image Representation, vol.34, pp.12, 2016.
25.
Marco Gori, Marco Lippi, Marco Maggini, Stefano Melacci, Marcello Pelillo, Image Analysis and Processing ? ICIAP 2015, vol.9280, pp.697, 2015.
26.
Andreas Geiger, Chaohui Wang, Pattern Recognition, vol.9358, pp.183, 2015.
27.
Joseph Tighe, Marc Niethammer, Svetlana Lazebnik, "Scene Parsing with Object Instance Inference Using Regions and Per-exemplar Detectors", International Journal of Computer Vision, vol.112, no.2, pp.150, 2015.
28.
Saurabh Gupta, Ross Girshick, Pablo Arbelaez, Jitendra Malik, Computer Vision ? ECCV 2014, vol.8695, pp.345, 2014.
Contact IEEE to Subscribe

References

References is not available for this document.