I. Introduction
High-speed trains have become more and more important in traffic and transportation due to their high speed, stability, and convenience [1], [2]. However, a bullet train is a complex system consisting of numerous close-coupled components [3], [4], whose safety requirements are extremely high. Therefore, it is essential to diagnose faults for newly deployed trains in time based on massive monitoring data [5]. However, since the collected monitoring data of new trains are limited [6], [7], especially the faulty data are rare, it is difficult to develop an accurate enough fault diagnosis model [8]. By now, a proven solution is to identify the monitoring data online and absorb the new information into the model for updating it synchronously.