1. Introduction
Generative models that produce high-quality face images, such as human avatar generation, face super-resolution, and face swapping, require high-resolution, high-quality training data sets. Low-resolution images neg-atively affect the resulting models' output. It is therefore crucial to identify and remove these images prior to training. This seemingly simple task is challenging, however, since high-resolution does not always imply high-quality, so users must often resort to visual inspection.