I. Introduction
Transfer learning aims to store a solution model of an existing problem and apply it to other related but different problems [1] –[3]. According to the advantages of transfer learning, many researchers have proposed the transfer fault diagnosis methods for rolling bearings. With this method, the diagnosis model learned from one kind of labeled fault dataset of rolling bearings (source domain dataset) can be applied to another related but different unlabeled fault dataset of rolling bearings (target domain dataset). And this method breaks the barrier that different domain datasets have different probability distribution by the way of domain alignment [4–5]. Many researchers have proposed different transfer diagnosis model for rolling bearings based on above ideas. For example, Li et al. [6] proposed an adversarial transfer fault diagnosis method based on fault knowledge mapping for rolling bearing to achieve across domain fault diagnosis. Wu et al. [7] proposed an adaptive depth transfer method to achieve domain alignment by introducing long-short term memory recurrent neural networks and joint distribution adaptation methods. Lei et al. [8] proposed a transfer fault diagnosis method by introducing the polynomial kernel maximum mean difference method to improve the performance of domain adaptation.