I. Introduction
Deep learning models have exhibited remarkable performance in various domains, such as medical image analysis [1], [2], remote sensing image recognition [3], [4], denoising [5], and captioning [6], [7], [8]. However, Szegedy et al. [9] discovered that these high-performing models can be vulnerable to carefully crafted perturbations, leading to erroneous predictions. This vulnerability has sparked concerns about the robustness of artificial intelligence systems.