I. Introduction
Single image super-resolution (SR), the task of increasing the spatial resolution and details of a given image, has attracted great attention from the computer vision research communities in the last few decades [1], [2]. In recent years, the state of the art of SR has been set and reset by various deep convolutional neural network (DCNN) based techniques [3]–[5]. In addition to the significant improvement in SR performance, these DCNN techniques also provide some important insights on the designs of DCNN architec- tures [6]–[9] and loss functions [4], [10]–[12]. However, for many real-world problems, the efficacy of a machine learning technique relies not only on the design of the technique itself but also, sometimes even more critically, on the quality and representativeness of the training data.