I. Introduction
Salient object detection (SOD) is an essential and important task in computer vision. The goal of SOD is to detect and highlight the most salient objects in visual input, such as color images, RGB-D images and videos. It has been applied to many other computer vision tasks, such as visual tracking [1], image captioning [2], weakly supervised learning [3], object segmentation [4], [5], etc. Several surveys on color image SOD [6]–[8], RGB-D SOD [9], [10] and video SOD [11], [12] summarize recent developments of SOD in detail. Since the distance-to-camera cues of depth maps naturally supplement appearance information from RGB images for SOD, RGB-D SOD has recently attracted increasing amount of research attention, especially considering the popularity of affordable RGB-D sensors. Numerous RGB-D SOD methods [13]–[41] have been proposed for this purpose and substantial advancements have been achieved.