Loading [a11y]/accessibility-menu.js
Calibrated RGB-D Salient Object Detection | IEEE Conference Publication | IEEE Xplore

Calibrated RGB-D Salient Object Detection


Abstract:

Complex backgrounds and similar appearances between objects and their surroundings are generally recognized as challenging scenarios in Salient Object Detection (SOD). Th...Show More

Abstract:

Complex backgrounds and similar appearances between objects and their surroundings are generally recognized as challenging scenarios in Salient Object Detection (SOD). This naturally leads to the incorporation of depth information in addition to the conventional RGB image as input, known as RGB-D SOD or depth-aware SOD. Meanwhile, this emerging line of research has been considerably hindered by the noise and ambiguity that prevail in raw depth images. To address the aforementioned issues, we propose a Depth Calibration and Fusion (DCF) framework that contains two novel components: 1) a learning strategy to calibrate the latent bias in the original depth maps towards boosting the SOD performance; 2) a simple yet effective cross reference module to fuse features from both RGB and depth modalities. Extensive empirical experiments demonstrate that the proposed approach achieves superior performance against 27 state-of-the-art methods. Moreover, our depth calibration strategy alone can work as a preprocessing step; empirically it results in noticeable improvements when being applied to existing cutting-edge RGB-D SOD models. Source code is available at https://github.com/jiwei0921/DCF.
Date of Conference: 20-25 June 2021
Date Added to IEEE Xplore: 02 November 2021
ISBN Information:

ISSN Information:

Conference Location: Nashville, TN, USA

Funding Agency:

References is not available for this document.

1. Introduction

Salient Object Detection (SOD) is an important computer vision problem that aims to identify and segment the most prominent object in a scene. It has found successful applications in a variety of tasks such as object recognition [59], image retrieval [38], [61], SLAM [37] and video analysis [25], [19], [14]. To tackle the innate challenges in addressing difficult scenes with low texture contrast or in the presence of cluttered backgrounds, depth information has been incorporated as a complementary input source. The growing interests in the development of RGB-D SOD methods [12], [42], [48] are especially boosted by the rapid progress and flourish of varied 3D imaging sensors [29], ranging from the traditional stereo imaging that produces disparity maps, to the more recent structured lighting [76], [30], time-of-flight, light field [63], [71], [72] and LIDAR cameras that directly generate depth images. As showcased by the recent cross-modality fusion schemes [7], [10], [44], adding depth-map on top of RGB image as an extra input leads to superior performance in localizing salient objects on challenging scenes.

Select All
1.
Radhakrishna Achanta, Sheila Hemami, Francisco Estrada and Sabine Susstrunk, "Frequency-tuned salient region detection", CVPR, pp. 1597-1604, 2009.
2.
Robert S Allison, Barbara J Gillam and Elia Vecellio, "Binocular depth discrimination and estimation beyond interaction space", Journal of Vision, vol. 9, no. 1, pp. 10-10, 2009.
3.
Dinkar N Bhat and Shree K Nayar, "Stereo in the presence of specular reflection", ICCV, pp. 1086-1092, 1995.
4.
Ali Borji, Ming-Ming Cheng, Huaizu Jiang and Jia Li, "Salient object detection: A benchmark", IEEE Transactions on Image Processing, vol. 24, no. 12, pp. 5706-5722, 2015.
5.
Chenglizhao Chen, Jipeng Wei, Chong Peng and Hong Qin, "Depth-quality-aware salient object detection", IEEE Transactions on Image Processing, vol. 30, pp. 2350-2363, 2021.
6.
Chenglizhao Chen, Jipeng Wei, Chong Peng, Weizhong Zhang and Hong Qin, "Improved saliency detection in RGB-D images using two-phase depth estimation and selective deep fusion", IEEE Transactions on Image Processing, vol. 29, pp. 4296-4307, 2020.
7.
Hao Chen, Yongjian Deng, Youfu Li, Tzu-Yi Hung and Guosheng Lin, "RGBD salient object detection via disentangled cross-modal fusion", IEEE Transactions on Image Processing, vol. 29, pp. 8407-8416, 2020.
8.
Hao Chen and Youfu Li, "Progressively complementarity-aware fusion network for RGB-D salient object detection", CVPR, pp. 3051-3060, 2018.
9.
Hao Chen and Youfu Li, "Three-stream attention-aware network for RGB-D salient object detection", IEEE Transactions on Image Processing, vol. 28, no. 6, pp. 2825-2835, 2019.
10.
Hao Chen, Youfu Li and Dan Su, "Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection", Pattern Recognition, vol. 86, pp. 376-385, 2019.
11.
Shuhan Chen and Yun Fu, "Progressively guided alternate refinement network for RGB-D salient object detection", ECCV, 2020.
12.
Zuyao Chen, Runmin Cong, Qianqian Xu and Qingming Huang, "Depth potentiality-aware gated attention network for RGB-D salient object detection", IEEE Transactions on Image Processing, 2020.
13.
Yupeng Cheng, Huazhu Fu, Xingxing Wei, Jiangjian Xiao and Xiaochun Cao, "Depth enhanced saliency detection method", Proceedings of International Conference on Internet Multimedia Computing and Service, pp. 23-27, 2014.
14.
Runmin Cong, Jianjun Lei, Huazhu Fu, Ming-Ming Cheng, Weisi Lin and Qingming Huang, "Review of visual saliency detection with comprehensive information", IEEE Transactions on circuits and Systems for Video Technology, vol. 29, no. 10, pp. 2941-2959, 2018.
15.
Runmin Cong, Jianjun Lei, Huazhu Fu, Junhui Hou, Qingming Huang and Sam Kwong, "Going from RGB to RGBD saliency: A depth-guided transformation model", IEEE Transactions on Systems Man and Cybernetics, vol. 50, no. 8, pp. 3627-3639, 2020.
16.
Runmin Cong, Jianjun Lei, Huazhu Fu, Qingming Huang, Xiaochun Cao and Chunping Hou, "Co-saliency detection for RGBD images based on multi-constraint feature matching and cross label propagation", IEEE Transactions on Image Processing, vol. 27, no. 2, pp. 568-579, 2018.
17.
Runmin Cong, Jianjun Lei, Huazhu Fu, Qingming Huang, Xiaochun Cao and Nam Ling, "HSCS: Hierarchical sparsity based co-saliency detection for RGBD images", IEEE Transactions on Multimedia, vol. 21, no. 7, pp. 1660-1671, 2018.
18.
Runmin Cong, Jianjun Lei, Huazhu Fu, Weisi Lin, Qingming Huang, Xiaochun Cao, et al., "An iterative co-saliency framework for RGBD images", IEEE Transactions on Cybernetics, vol. 49, no. 1, pp. 233-246, 2019.
19.
Runmin Cong, Jianjun Lei, Huazhu Fu, Fatih Porikli, Qingming Huang and Chunping Hou, "Video saliency detection via sparsity-based reconstruction and propagation", IEEE Transactions on Image Processing, vol. 28, no. 10, pp. 4819-4931, 2019.
20.
Runmin Cong, Jianjun Lei, Changqing Zhang, Qingming Huang, Xiaochun Cao and Chunping Hou, "Saliency detection for stereoscopic images based on depth confidence analysis and multiple cues fusion", IEEE Signal Processing Letters, vol. 23, no. 6, pp. 819-823, 2016.
21.
Karthik Desingh, K Madhava Krishna, Deepu Rajan and CV Jawahar, "Depth really matters: Improving visual salient region detection with depth", BMVC, 2013.
22.
Deng-Ping Fan, Ming-Ming Cheng, Jiang-Jiang Liu, Shang-Hua Gao, Qibin Hou and Ali Borji, "Salient objects in clutter: Bringing salient object detection to the foreground", ECCV, pp. 186-202, 2018.
23.
Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng and Ali Borji, "Enhanced-alignment measure for binary foreground map evaluation", IJCAI, pp. 698-704, 2018.
24.
Deng-Ping Fan, Zheng Lin, Zhao Zhang, Menglong Zhu and Ming-Ming Cheng, "Rethinking RGB-D salient object detection: Models data sets and large-scale benchmarks", IEEE Transactions on Neural Networks and Learning Systems, 2020.
25.
Deng-Ping Fan, Wenguan Wang, Ming-Ming Cheng and Jianbing Shen, "Shifting more attention to video salient object detection", CVPR, pp. 8554-8564, 2019.
26.
Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang and Ling Shao, "BBS-Net: RGB-D salient object detection with a bifurcated backbone strategy network", ECCV, 2020.
27.
David Feng, Nick Barnes, Shaodi You and Chris McCarthy, "Local background enclosure for RGB-D salient object detection", CVPR, pp. 2343-2350, 2016.
28.
Keren Fu, Deng-Ping Fan, Ge-Peng Ji and Qijun Zhao, "JL-DCF: Joint learning and densely-cooperative fusion framework for RGB-D salient object detection", CVPR, pp. 3052-3062, 2020.
29.
Silvio Giancola, Matteo Valenti and Remo Sala, A Survey on 3D Cameras: Metrological Comparison of Time-of-Flight Structured-Light and Active Stereoscopy Technologies, Springer, 2018.
30.
Chunle Guo, Chongyi Li, Jichang Guo, Chen Change Loy, Junhui Hou, Sam Kwong, et al., "Zero-reference deep curve estimation for low-light image enhancement", CVPR, pp. 1780-1789, 2020.
Contact IEEE to Subscribe

References

References is not available for this document.