Loading [MathJax]/extensions/MathMenu.js
C2AD: Dual Consistency Learning for Zero-Shot Anomaly Detection | IEEE Conference Publication | IEEE Xplore

C2AD: Dual Consistency Learning for Zero-Shot Anomaly Detection


Abstract:

Zero-shot anomaly detection (ZSAD) is dedicated to detecting anomalies without having any seen normal or abnormal samples for the target set. Existing approaches utilize ...Show More

Abstract:

Zero-shot anomaly detection (ZSAD) is dedicated to detecting anomalies without having any seen normal or abnormal samples for the target set. Existing approaches utilize the pre-trained CLIP to assess normality/abnormality by exploiting the similarity between images and text with the frozen visual encoder. However, the frozen CLIP visual encoder impedes performance improvements. Additionally, their representations of anomalies are sensitive to contextual variations, leading to poor localization of unseen abnormalities. Therefore, this paper introduces the Dual Consistency Learning for Zero-Shot Anomaly Detection (C2AD), comprising two components: semantic and contextual consistency. Semantic consistency enhances generalization by maintaining correlational semantic consistency, while contextual consistency encourages representations to be robust to contextual changes. C2AD improves the model training without adding extra computational overhead during inference. Comprehensive experiments demonstrate that C2AD can boost the performance of ZSAD in anomaly detection and localization, achieving state-of-the-art results.
Date of Conference: 06-11 April 2025
Date Added to IEEE Xplore: 07 March 2025
ISBN Information:

ISSN Information:

Conference Location: Hyderabad, India

Funding Agency:


I. Introduction

Anomaly detection plays a crucial role in various industrial applications, ensuring product quality and system reliability [1] [2] [3] [4] [5]. Recently, zero-shot anomaly detection methods, such as those based on the CLIP (Contrastive Language-Image Pre-training) model [6] [7], have gained significant attention due to their ability to detect anomalies without requiring labeled anomaly data during training. However, directly applying CLIP to industrial anomaly detection tasks can be challenging [8] [9], as the model was trained on a vast dataset of natural images that may not fully capture the unique characteristics of industrial settings [10].

Contact IEEE to Subscribe

References

References is not available for this document.