I. Introduction
Recent advances in deep learning have achieved remarkable successes in various challenging tasks, including computer vision [1]–[3], natural language processing [4], [5] and speech [6], [7]. In practice, deep learning has been routinely applied on large-scale datasets containing data collected from daily life, which inevitably contain large amounts of noise including adversarial examples and corruption [8], [9]. Unfortunately, while such noise is imperceptible to human beings, it is highly misleading to deep neural networks, which presents potential security threats for practical machine learning applications in both the digital and physical world [10]–[14].