I. Introduction
While computer vision models have been applied to different areas of research to this date, developing models that can perform well when perturbations are present in the input image is still a challenging quest. To use computer vision in practice, we need reliable models that are robust against major and minor image perturbations. Nevertheless, even the most commonly used models fail to perform well when minor perturbations in light, color, etc. are present in the input image [1]–[4]. Fig. 1 illustrates some examples that can easily fool a trained ResNet-50 [5] model with 89.06% in-domain accuracy. To push the boundaries of computer vision, more work still needs to be done to make models reliable enough to be used even in use cases, such as in autonomous vehicles, where a faulty decision can lead to disastrous consequences.