I. Introduction
Face is of central importance for human identity recognition. The low-resolution (LR) face images captured by camera sensors would largely degrade the corresponding identity information. Face super-resolution (SR) aims to estimate high-resolution (HR) face images from LR ones, to improve the image quality and performance of subsequent identity recognition tasks [1], [2], [3]. This task is very challenging upon complex real-world scenarios, where the degradation kernel is usually unknown. Traditional face SR methods can be roughly divided into local patch-based methods [4], [5], [6], global image-based methods [7], [8], [9], and hybrid methods taking advantage of global image consistency and local patch sparsity [10], [11], [12], [13]. However, these hand-crafted methods could hardly achieve satisfactory results upon diverse real-world scenarios [14].