1 Introduction
Single image super resolution aims at reconstructing/generating a high-resolution (HR) image from a low-resolution (LR) observation. Thanks to the strong learning capability, Convolutional Neural Networks (CNNs) have demonstrated superior performance [1], [2], [3], [4], [5], [6], [7], [8] to the conventional example-based [9], [10], [11], [12], [13] and interpolation-based [14], [15], [16] algorithms. Recent CNN-based methods can be divided into two groups. The first one regards SR as a reconstruction problem and adopts mean squared error (MSE) as the loss function to achieve high PSNR values. However, due to the conflict between the reconstruction accuracy and visual quality, they tend to produce overly smoothed/sharpened images. To favour better visual quality, the second group casts SR as an image generation problem [17]. By incorporating the perceptual loss [18], [19] and adversarial learning [17], these perceptual SR methods have the potential to generate realistic textures and details, thus attracted increasing attention in recent years.