I. Introduction
Medical image classification aims to distinguish medical images based on clinical pathologies or imaging modalities [1], and medical image segmentation can locate the position of lesions and assist clinicians to quantitatively evaluate the effect in preoperative and postoperative examination [2]. In recent years, deep learning methods, especially deep convolutional neural networks (DCNNs), have gained significant achievements in medical image classification and segmentation tasks. However, they have not achieved the same success on medical images as they have done on natural images [3], [4]. In this work, two challenges in medical image classification and segmentation are exploited and considered.
Lack of feature diversity: Different from natural color images, most medical images are gray-scale images and the lesion area usually has low intensity contrast with the neighboring normal tissues [5], [6]. As shown in Fig. 1(a) of one malignant ultrasound image for breast, the lesion area is blocked with yellow dotted line, which is shown to have very low intensity contrast with the surrounding normal tissues. Hence, it is difficult to extract as diverse features as that extracted from natural color images through convolution operation only [7];
Neglect of small lesions: The key information such as the lesion area in medical images often occupies much fewer pixels than normal tissue and the lesion area is different from normal tissue on particulars [8], [9]. In other words, the particulars in medical images are not so unimportant as in natural images. As shown in Fig. 1(b) of one AMD (Age-related Macular Degeneration) fundus images. The lesion areas are blocked with yellow dotted line, which are too small and inconspicuous when compared to normal tissues. Hence, it is difficult to learn discriminant features from the small and inconspicuous lesions through convolution operation only due to the neglect of small lesions in high level feature maps [10].
Illustration of medical image examples: (a) a malignant ultrasound image; and (b) an AMD fundus image.