I. Introduction
High-resolution face images contain rich information crucial for many computer vision tasks, such as face swipe payment, criminal investigation, and identity auditing. However, due to the limitations of physical devices, environment, and imaging conditions in real-life scenarios, the obtained facial images are often of low resolution, making it difficult to achieve the expected results in face-related tasks [1], [2], [3], [4], [5]. Therefore, many vision tasks [6], [7], [8], [9], [10], [11], [12], [13] urgently need to infer credible and clear content from low-resolution face images.