I. Introduction
In the field of visual recognition, the data distributions of the training data and the testing data are usually quite different, in which the training set (resp., the testing set) is referred to as the source domain (resp., the target domain). Recently, abundant domain adaptation approaches [1]–[14] were proposed to reduce the data distribution mismatch between the source domain and the target domain explicitly. Nevertheless, the target domain samples are often unavailable during the training procedure and this problem is named domain generalization [15]. In comparison with domain adaptation, domain generalization aims to learn robust classifiers that can generalize well to arbitrary target domain. More recently, several domain generalization approaches [15]–[18] were also developed to enhance the generalization capability of the classifiers learnt on the source domain. For more details about domain generalization and adaptation, please refer to Section II.