I. Introduction
As a fundamental image understanding technique, salient object detection (SOD) aims at segmenting the most eye-attracting objects in a natural image. Although recent SOD approaches [1]–[5] have achieved much success, their generated saliency maps cannot discriminate different salient instances, which has prevented many applications from applying SOD for instance-level image understanding [6]. Motivated by [7], in this paper, we tackle the more challenging case of SOD, called salient instance segmentation (SIS). SIS segments salient objects from an image and discriminates salient instances by associating each instance with a different label. SIS can facilitate more advanced tasks than SOD, such as image captioning [8], weakly-supervised semantic/instance segmentation [9], [10], and visual tracking [11].