Conferences >2022 IEEE/CVF Conference on C...

On Aliased Resizing and Surprising Subtleties in GAN Evaluation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Metrics for evaluating generative models aim to measure the discrepancy between real and generated images. The often-used Fréchet Inception Distance (FID) metric, for exa...Show More

Metadata

Abstract:

Metrics for evaluating generative models aim to measure the discrepancy between real and generated images. The often-used Fréchet Inception Distance (FID) metric, for example, extracts “high-level” features using a deep network from the two sets. However, we find that the differences in “low-level” preprocessing, specifically image resizing and compression, can induce large variations and have unforeseen consequences. For instance, when resizing an image, e.g., with a bilinear or bicubic kernel, signal processing principles mandate adjusting prefilter width depending on the downsampling factor, to antialias to the appropriate bandwidth. However, commonly-used implementations use a fixed-width prefilter, resulting in aliasing artifacts. Such aliasing leads to corruptions in the feature extraction down-stream. Next, lossy compression, such as JPEG, is commonly used to reduce the file size of an image. Although designed to minimally degrade the perceptual quality of an image, the operation also produces variations downstream. Furthermore, we show that if compression is used on real training images, FID can actually improve if the generated images are also subsequently compressed. This paper shows that choices in low-level image processing have been an under-appreciated aspect of generative modeling. We identify and characterize variations in generative modeling development pipelines, provide recommendations based on signal processing principles, and release a reference implementation to facilitate future comparisons.

Published in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 18-24 June 2022

Date Added to IEEE Xplore: 27 September 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPR52688.2022.01112

Conference Location: New Orleans, LA, USA

Contents

1. Introduction

With the proliferation of generative modeling techniques, such as Generative Adversarial Networks (GANs) [24], accurately discerning which methods are performing better has become a critical aspect of the field. For visual data, metrics such as Inception Score (IS) [59], Kernel Inception Distance (KID) [4], and the ubiquitously-used Frechet Inception Distance (FID) [26] have become standard practice for developing and adopting models. Under the hood, these methods evaluate the discrepancy between generated and natural images, in a deep feature space, to capture relevant features of the two distributions. After all, at its core, gener-ative modeling involves learning and mimicking high-order, complex statistics of visual data. Figure 1.

Downsampling a circle. We resize an input image (left) by a factor of 8, using different image processing libraries. The lanczos, bicubic, and bilinear implementations by pil (top row) adjust the antialiasing filter width by the downsampling factor (marked as other implementations (including those used for pytorch-fid and tensorflow-fid) use fixed filter widths, introducing aliasing artifacts (marked as and resemble naive nearest subsampling. Aliasing artifacts induce inconsistencies in the calculation of downstream metrics such as frechet inception distance [26], kid [4], is [59], and ppl [33]. Note that antialias flag is available in tensorflow 2, but is set to false (default value) for the fid calculation.

References is not available for this document.

On Aliased Resizing and Surprising Subtleties in GAN Evaluation

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

On Aliased Resizing and Surprising Subtleties in GAN Evaluation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References