I. Introduction
Hyperspectral imaging has advantages for acquiring two-dimensional images across a wide range of electromagnetic spectrums, providing more discriminative information. There are various domains where hyperspectral imaging has been used [1], [2], [3], [4], [5]. Due to the absorption of different wavelength bands by different biological tissues, hyperspectral imaging has tremendous promise for noninvasive illness detection and surgical guiding [6]. Because medical hyperspectral images (MHSIs) have hundreds of bands to examine, computer-aided algorithms are necessary to alleviate the load of diagnosis on physicians. More recently, deep learning-based algorithms have shown promise in fields as CAD of MHSIs [7], [8], [9], and 3DCNN is the most frequently used deep learning method for hyperspectral data due to its ability to utilize both spatial and spectral domains simultaneously [10], [11]. However, the performances of these data-hungry algorithms are limited for the difficulty to realize large-scale annotation of MHSIs.