Self-Supervised Scene De-Occlusion | IEEE Conference Publication | IEEE Xplore

Self-Supervised Scene De-Occlusion


Abstract:

Natural scene understanding is a challenging task, particularly when encountering images of multiple objects that are partially occluded. This obstacle is given rise by v...Show More

Abstract:

Natural scene understanding is a challenging task, particularly when encountering images of multiple objects that are partially occluded. This obstacle is given rise by varying object ordering and positioning. Existing scene understanding paradigms are able to parse only the visible parts, resulting in incomplete and unstructured scene interpretation. In this paper, we investigate the problem of scene de-occlusion, which aims to recover the underlying occlusion ordering and complete the invisible parts of occluded objects. We make the first attempt to address the problem through a novel and unified framework that recovers hidden scene structures without ordering and amodal annotations as supervisions. This is achieved via Partial Completion Network (PCNet)-mask (M) and -content (C), that learn to recover fractions of object masks and contents, respectively, in a self-supervised manner. Based on PCNet-M and PCNet-C, we devise a novel inference scheme to accomplish scene de-occlusion, via progressive ordering recovery, amodal completion and content completion. Extensive experiments on real-world scenes demonstrate the superior performance of our approach to other alternatives. Remarkably, our approach that is trained in a self-supervised manner achieves comparable results to fully-supervised methods. The proposed scene de-occlusion framework benefits many applications, including high-quality and controllable image manipulation and scene recomposition (see Fig. 1), as well as the conversion of existing modal mask annotations to amodal mask annotations.
Date of Conference: 13-19 June 2020
Date Added to IEEE Xplore: 05 August 2020
ISBN Information:

ISSN Information:

Conference Location: Seattle, WA, USA
Citations are not available for this document.

1. Introduction

Scene understanding is one of the foundations of machine perception. A real-world scene, regardless of its context, often comprises multiple objects of varying ordering and positioning, with one or more object(s) being occluded by other object(s). Hence, scene understanding systems should be able to process modal perception, i.e., parsing the directly visible regions, as well as amodal perception [1]–[3], i.e., perceiving the intact structures of entities including invisible parts. The advent of advanced deep networks along with large-scale annotated datasets has facilitated many scene understanding tasks, e.g., object detection [4]–[7], scene parsing [8]–[10], and instance segmentation [11]–[14]. Nonetheless, these tasks mainly concentrate on modal perception, while amodal perception remains rarely explored to date.

Cites in Papers - |

Cites in Papers - IEEE (51)

Select All
1.
Hai Wang, Shilin Zhu, Long Chen, Yicheng Li, Yingfeng Cai, "OccludedInst: An Efficient Instance Segmentation Network for Automatic Driving Occlusion Scenes", IEEE Transactions on Emerging Topics in Computational Intelligence, vol.9, no.1, pp.253-270, 2025.
2.
Kaziwa Saleh, Sándor Szénási, Zoltán Vámossy, "Mask Guided Gated Convolution for Amodal Content Completion", 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), pp.000321-000326, 2024.
3.
Seong-Uk Jo, Du Yeol Lee, Chae Eun Rhee, "Occlusion-Aware Amodal Depth Estimation for Enhancing 3D Reconstruction From a Single Image", IEEE Access, vol.12, pp.106524-106536, 2024.
4.
Minh Tran, Winston Bounsavy, Khoa Vo, Anh Nguyen, Tri Nguyen, Ngan Le, "ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation", 2024 International Joint Conference on Neural Networks (IJCNN), pp.1-8, 2024.
5.
Katherine Xu, Lingzhi Zhang, Jianbo Shi, "Amodal Completion via Progressive Mixed Context Diffusion", 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.9099-9109, 2024.
6.
Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman, "Amodal Ground Truth and Completion in the Wild", 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.28003-28013, 2024.
7.
Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh, Gerasimos Lampouras, Ignacio Iacobacci, Sarah Parisot, "MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation", 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.22413-22422, 2024.
8.
Ege Ozguroglu, Ruoshi Liu, Dídac Surís, Dian Chen, Achal Dave, Pavel Tokmakov, Carl Vondrick, "pix2gestalt: Amodal Segmentation by Synthesizing Wholes", 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.3931-3940, 2024.
9.
Jie Gui, Tuo Chen, Jing Zhang, Qiong Cao, Zhenan Sun, Hao Luo, Dacheng Tao, "A Survey on Self-Supervised Learning: Algorithms, Applications, and Future Trends", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.46, no.12, pp.9052-9071, 2024.
10.
Qiankun Liu, Yuqi Jiang, Zhentao Tan, Dongdong Chen, Ying Fu, Qi Chu, Gang Hua, Nenghai Yu, "Transformer Based Pluralistic Image Completion With Reduced Information Loss", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.46, no.10, pp.6652-6668, 2024.
11.
Jiayang Ao, Qiuhong Ke, Krista A. Ehinger, "Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark", 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.280-289, 2024.
12.
Zhixuan Li, Weining Ye, Tingting Jiang, Tiejun Huang, "GIN: Generative INvariant Shape Prior for Amodal Instance Segmentation", IEEE Transactions on Multimedia, vol.26, pp.3924-3936, 2024.
13.
Xin Li, Wenjie Pei, Yaowei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang, "Self-Supervised Tracking via Target-Aware Data Synthesis", IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.7, pp.9186-9197, 2024.
14.
Sihao Qi, Jiexin Xie, Haitao Yan, Shijie Guo, "DenseXFormer: An Effective Occluded Human Instance Segmentation Network based on Density Map for Nursing Robot", 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp.1-6, 2023.
15.
Wenxuan Ma, Liming Zheng, Yinghao Cai, Tao Lu, Shuo Wang, "Multi-view Self-supervised Object Segmentation", 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp.1-8, 2023.
16.
Bangning Ji, Gang He, "A Novel Diffusion-Model-Based Bone Scan Image Inpainting Algorithm", 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp.4907-4909, 2023.
17.
Felipe A. Costa de Oliveira, Borja Carrillo-Perez, Alberto García-Ortiz, Frank Sill Torres, "Integrity Assessment of Maritime Object Detection Impacted by Partial Camera Obstruction", 2023 7th International Conference on System Reliability and Safety (ICSRS), pp.474-480, 2023.
18.
Yi Zhao, Jinping Zhai, Xiaohui Li, "OpCNet: Endowing vehicles with perspective vision: Clairvoyance of occluded Pedestrian crossing in complex driving scenes", 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall), pp.1-7, 2023.
19.
Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu, "Coarse-to-Fine Amodal Segmentation with Shape Prior", 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp.1262-1271, 2023.
20.
Ke Fan, Jingshi Lei, Xuelin Qian, Miaopeng Yu, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu, "Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation", 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp.1272-1281, 2023.
21.
Zhicheng Zhang, Shengzhe Liu, Jufeng Yang, "Multiple Planar Object Tracking", 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp.23403-23413, 2023.
22.
Zhixuan Li, Weining Ye, Juan Terven, Zachary Bennett, Ying Zheng, Tingting Jiang, Tiejun Huang, "MUVA: A New Large-Scale Benchmark for Multi-view Amodal Instance Segmentation in the Shopping Scenario", 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp.23447-23456, 2023.
23.
Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H.S. Torr, Song Bai, "MOSE: A New Dataset for Video Object Segmentation in Complex Scenes", 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp.20167-20177, 2023.
24.
S. Raghavendra, Divya Rao, S. K. Abhilash, Venu Madhav Nookala, Praveen Gurunath Bharathi, "Elevating Amodal Segmentation Using ASH-Net Architecture for Accurate Object Boundary Estimation", IEEE Access, vol.11, pp.83377-83389, 2023.
25.
Ziling Wu, Armaghan Moemeni, Simon Castle-Green, Praminda Caleb-Solly, "Robustness of Deep Learning Methods for Occluded Object Detection - A Study Introducing a Novel Occlusion Dataset", 2023 International Joint Conference on Neural Networks (IJCNN), pp.1-10, 2023.
26.
Zhixuan Li, Ruohua Shi, Tiejun Huang, Tingting Jiang, "OAFormer: Learning Occlusion Distinguishable Feature for Amodal Instance Segmentation", ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.1-5, 2023.
27.
Jianming Xu, Yunfei Li, Qian Shi, Lin He, "Occluded Scene Classification via Cascade Supervised Contrastive Learning", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.16, pp.4565-4578, 2023.
28.
Lei Ke, Yu-Wing Tai, Chi-Keung Tang, "Occlusion-Aware Instance Segmentation Via BiLayer Network Architectures", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.45, no.8, pp.10197-10211, 2023.
29.
Zhijing Yang, Junyang Chen, Yukai Shi, Hao Li, Tianshui Chen, Liang Lin, "OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup", IEEE Transactions on Multimedia, vol.25, pp.1477-1488, 2023.
30.
Ruisong Zhang, Weize Quan, Yong Zhang, Jue Wang, Dong-Ming Yan, "W-Net: Structure and Texture Interaction for Image Inpainting", IEEE Transactions on Multimedia, vol.25, pp.7299-7310, 2023.

Cites in Papers - Other Publishers (29)

1.
Basile Van Hoorick, Rundi Wu, Ege Ozguroglu, Kyle Sargent, Ruoshi Liu, Pavel Tokmakov, Achal Dave, Changxi Zheng, Carl Vondrick, "Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis", Computer Vision – ECCV 2024, vol.15082, pp.313, 2025.
2.
Fernando Hermosillo-Reynoso, Deni Torres-Roman, "A Tensor Space for Multi-View and Multitask Learning Based on Einstein and Hadamard Products: A Case Study on Vehicle Traffic Surveillance Systems", Sensors, vol.24, no.23, pp.7463, 2024.
3.
Li Wei, Huang Ya, Zhang Xinyuan, Han Guijin, "Contour refinement instance segmentation for occluded objects", Journal of Image and Graphics, vol.29, no.5, pp.1221, 2024.
4.
Márton Szemenyei, Nándor Kőfaragó, "NeRF-YOLO: Detecting Occluded Objects via Multi-view Geometric Aggregation", 25th International Symposium on Measurements and Control in Robotics, vol.154, pp.13, 2024.
5.
Ivana Petrovska, Boris Jutzi, "Vision through Obstacles—3D Geometric Reconstruction and Evaluation of Neural Radiance Fields (NeRFs)", Remote Sensing, vol.16, no.7, pp.1188, 2024.
6.
Haiming Gan, Francesca Menegon, Aoshen Sun, Annalisa Scollo, Qingyan Jiang, Yueju Xue, Tomas Norton, "Peeking into the unseen: Occlusion-resistant segmentation for preweaning piglets under crushing events", Computers and Electronics in Agriculture, vol.219, pp.108683, 2024.
7.
Miguel Luna, Philip Chikontwe, Siwoo Nam, Sang Hyun Park, "Attention guided multi-scale cluster refinement with extended field of view for amodal nuclei segmentation", Computers in Biology and Medicine, pp.108015, 2024.
8.
Tharindu Kaluarachchi, Shamane Siriwardhana, Elliott Wen, Suranga Nanayakkara, "A Corneal Surface Reflections-Based Intelligent System for Lifelogging Applications", International Journal of Human?Computer Interaction, vol.39, no.9, pp.1963, 2023.
9.
Fengnan Quan, Bo Lang, "GIGAN: Self?supervised GAN for generating the invisible using cycle transformation and conditional normalization", IET Image Processing, 2023.
10.
Endai Huang, Zheng He, Axiu Mao, Maria Camila Ceballos, Thomas D. Parsons, Kai Liu, "A semi-supervised generative adversarial network for amodal instance segmentation of piglets in farrowing pens", Computers and Electronics in Agriculture, vol.209, pp.107839, 2023.
11.
Kaziwa Saleh, Sandor Szenasi, Zoltan Vamossy, "Generative Adversarial Network for Overcoming Occlusion in Images: A Survey", Algorithms, vol.16, no.3, pp.175, 2023.
12.
Marton Szemenyei, Matyas Szanto, "Occlusion Avoidance in a Simulated Environment Using Reinforcement Learning", Applied Sciences, vol.13, no.5, pp.3090, 2023.
13.
Jiayang Ao, Qiuhong Ke, Krista A. Ehinger, "Image amodal completion: A survey", Computer Vision and Image Understanding, vol.229, pp.103661, 2023.
14.
Qian Zhang, Qiyao Liang, Hong Liang, Ying Yang, "Removal and Recovery of the Human Invisible Region", Symmetry, vol.14, no.3, pp.531, 2022.
15.
Bo Wang, Qian Li, Zheng You, "Self-supervised Learning based Transformer and Convolution Hybrid Network for one-shot Organ Segmentation", Neurocomputing, 2022.
16.
Jingyu Wu, Zejian Li, Shengyuan Zhang, Lingyun Sun, "Amodal Layout Completion in Complex Outdoor Scenes", Artificial Intelligence, vol.13604, pp.30, 2022.
17.
Xiu Chen, Yujie Li, Yun Li, "Multi-feature fusion point cloud completion network", World Wide Web, vol.25, no.4, pp.1551, 2022.
18.
Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo, "3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal", Computer Vision ? ECCV 2022, vol.13666, pp.380, 2022.
19.
Yifan Wang, Yongping Xie, "PCNetOP: Partial Completion Network with Order Prediction", The International Conference on Image, Vision and Intelligent Systems (ICIVIS 2021), vol.813, pp.1037, 2022.
20.
Wen Zhang, Zhonghua Miao, Nan Li, Chuangxin He, Teng Sun, "Review of Current Robotic Approaches for Precision Weed Management", Current Robotics Reports, vol.3, no.3, pp.139, 2022.
21.
Jia Gu, Fangzheng Tian, Il-Seok Oh, "Retinal vessel segmentation based on self-distillation and implicit neural representation", Applied Intelligence, 2022.
22.
Sang-Min Park, Young-Gab Kim, "Visual language navigation: a survey and open challenges", Artificial Intelligence Review, 2022.
23.
Bangbang Yang, Yinda Zhang, Yijin Li, Zhaopeng Cui, Sean Fanello, Hujun Bao, Guofeng Zhang, "Neural rendering in a room", ACM Transactions on Graphics, vol.41, no.4, pp.1, 2022.
24.
Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai, "Occluded Video Instance Segmentation: A Benchmark", International Journal of Computer Vision, vol.130, no.8, pp.2022, 2022.
25.
Matthias Korschens, Paul Bodesheim, Christine Romermann, Solveig Franziska Bucher, Mirco Migliavacca, Josephine Ulrich, Joachim Denzler, "Weakly Supervised Segmentation Pretraining for Plant Cover Prediction", Pattern Recognition, vol.13024, pp.589, 2021.
26.
Chuanxia Zheng, Duy-Son Dao, Guoxian Song, Tat-Jen Cham, Jianfei Cai, "Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition", International Journal of Computer Vision, vol.129, no.12, pp.3195, 2021.
27.
Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin, "Side-Aware Boundary Localization for More Precise Object Detection", Computer Vision ? ECCV 2020, vol.12349, pp.403, 2020.
28.
Matthias Korschens, Paul Bodesheim, Christine Romermann, Solveig Franziska Bucher, Josephine Ulrich, Joachim Denzler, "Towards Confirmable Automated Plant Cover Determination", Computer Vision ? ECCV 2020 Workshops, vol.12540, pp.312, 2020.
29.
Siqi Liu, Libiao Jin, Fang Miao, "Textual restoration of occluded Tibetan document pages based on side-enhanced U-Net", Journal of Electronic Imaging, vol.29, no.06, 2020.
Contact IEEE to Subscribe

References

References is not available for this document.