I. INTRODUCTION
The data imbalance is a widely existed and challenging problem in numerous real-world applications, such as medical diagnosis [1], fraud detection [2], fault detection [3] and others. The performance of most classification algorithms may significantly deteriorate under imbalanced data conditions, which results from the bias in the majority data class [4]. The common strategy to mitigate problems of this type is to resample the data before training models, including oversampling, under-sampling, and some kinds of combination of both. Among them, the work in paper [5] verifies that the oversampling techniques is the most effective way to handle the class imbalance for the Convolutional Neural Networks (CNN) model on image classification.