I. Introduction
With the continuous development in both hardware reliability and control strategy, UAVs have been widely employed in more and more practical applications [1], such as atmosphere monitoring [2], tracking and surveillance [3], and land cover and land use monitoring [4]. However, limited by the physical characteristics of the UAV such as the flight altitude and load capacity, sometimes, it is impractical to continuously obtain HR images through UAV, especially for those applications that need large-scale and long-duration UAV video data [5]. Super-resolution is a promising solution to alleviate this dilemma, which can reconstruct HR images from LR observations [6]. In fact, the super-resolution technology particularly the deep-learning-based methods has already been widely applied for the processing of various kinds of images, such as natural images [7], [8], medical images [9], [10], and remote sensing images [11], [12], [13]. These works can not only enhance the image quality but also can further facilitate downstream applications [14], [15].