I. Introduction
In Clinical diagnosis, the core of medical image segmentation is to delineate the objects of interest from the complex background on various biomedical images [1], [2], [3], such as X-ray, Computerized Tomography (CT), Magnetic Resonance Imaging (MRI), and Ultrasound. It is useful for quantitative diagnosis and morphological analysis of specific lesions in human organs and tissues. As shown in Fig. 1, it requires enormous effort and patience to handle the complex contours and textures. However, traditional manual annotation heavily relies on clinical experiences. Measurements based on manual annotations by clinicians might be highly accurate, but they can also be labor-intensive under typical clinical settings. Therefore, it is in great demand to develop accurate medical image segmentation methods.