1. INTRODUCTION
As advancements in deepfake technologies lead to heightened realism in facial manipulation images [1], [2], the field of face forgery detection has become an increasingly captivating research area. Many existing methods primarily concentrate on within-dataset detection [3] –[5], where forgery methods are predefined and known in both the training and testing sets. Due to the continual emergence of forgery methods in practical scenarios, methods with high within-dataset detection accuracy usually encounter a significant performance degradation in cross-dataset scenario, thereby constraining their broader applicability. When a neural network is retrained to learn from a new dataset, it tends to forget the knowledge it acquired during the initial training on the previous task, a phenomenon referred to as the catastrophic forgetting problem [6].