Conferences >2024 IEEE/CVF Winter Conferen...

Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present a novel framework for generating adversarial benchmarks to evaluate the robustness of image classification models. Our framework allows users to customize the ...Show More

Metadata

Abstract:

We present a novel framework for generating adversarial benchmarks to evaluate the robustness of image classification models. Our framework allows users to customize the types of distortions to be optimally applied to images, which helps address the specific distortions relevant to their deployment. The benchmark can generate datasets at various distortion levels to assess the robustness of different image classifiers. Our results show that the adversarial samples generated by our framework with any of the image classification models, such as ResNet-50, Inception-V3 and VGG16, are effective and transferable to other models causing them to fail. These failures happen even when these models are adversarially retrained using state-of-the-art techniques, demonstrating the generalizability of our adversarial samples. We achieve competitive performance in terms of net L2 distortion compared to state-of-the-art benchmark techniques on CIFAR-10 and ImageNet; however, we demonstrate that our framework achieves such results with simple distortions like Gaussian noise without introducing unnatural artifacts or color bleeds. This is made possible by a model-based reinforcement learning (RL) agent and a technique that reduces a deep tree search of the image for model sensitivity to perturbations, to a one-level analysis and action. The flexibility of choosing distortions and setting classification probability thresholds for multiple classes makes our framework suitable for algorithmic audits.

Published in: 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Date of Conference: 03-08 January 2024

Date Added to IEEE Xplore: 09 April 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/WACV57701.2024.00436

Conference Location: Waikoloa, HI, USA

Contents

References is not available for this document.

MIT Libraries

MIT Libraries

Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References