I. Introduction
Due to the production of massive remote sensing data, many image interpretation tasks have developed rapidly in recent years, such as image classification, image retrieval, and semantic segmentation [1]–[3]. Among them, high-resolution remote sensing (HRRS) image scene classification has also become an active topic, and it aims at refining semantic-level information and allocating a class label to the given image [4]–[6].