I. Introduction
Power transformers are the crucial equipment required for power transmission and conversion so that whether they function well has a considerable impact on the stability of the entire power system [1]. At present, a large majority of the transformers in operation are oil-immersed transformers. When a transformer fails, it produces various gases that dissolve in the transformer oil, including hydrogen (H2), methane (CH4), ethane (C2H6), ethylene (C2H4), acetylene (C2H2), and many more. There is a close correlation between the gas content in the oil and the type of transformer failure [2], [3]. For this reason, the gas concentration in oil can be detected by means of dissolved gas analysis (DGA) to determine the types of fault [4]. With the advancement of artificial intelligence, the integration of deep learning, machine learning, neural networks, and other intelligent algorithms has been widely practiced for the detection of faults in oil-immersed transformers, which has been shown to significantly improve the accuracy of fault identification [5], [6]. To ensure the high accuracy of prediction results, sufficient and balanced training data are required for the process of AI model fitting. However, this imbalance impairs the generalization performance of the training network significantly, thus leading to a significant deviation in the training model [7].