I. Introduction
Various software systems serve our daily work in different aspects of life. However, during the operation of these systems, performance faults such as slow response times are inevitable. Once these faults occur, they can significantly impact the system's availability and reliability, resulting in financial losses. For example, according to a recent survey [1],the average cost per hour of server downtime is between 301,000 and 400,000. To minimize the losses caused by performance faults, remediation after the occurrence of a fault is one approach [2]–[7]. However, predicting and identifying potential risks before the faults happen and taking preventive measures can directly prevent service unavailability. Therefore, many engineers have conducted research in this area.