I. Introduction
Image quality assessment (IQA) is a long-standing challenge in computer vision, and it is vital to many image processing problems [1]–[11], including image acquisition, compression, enhancement, generation, and retrieval. In the past decades, a great number of IQA metrics have been proposed [12]–[27], which can be divided into full-reference (FR), reduced-reference (RR), and no-reference (NR) [1], [5]. Since reference images are not needed, NR-IQA, or called blind IQA (BIQA), has the widest applications in real-world scenarios. With the development of deep learning, nowaday’s BIQA metrics are putting more emphasis on authentic distortions. Although significant advances have been achieved, deep BIQA metrics are far from ideal in terms of both prediction accuracy and generalization ability.