Journals & Magazines >IEEE Access >Volume: 12

Performance and Adaptability Testing of Machine Learning Models for Power Transmission Network Fault Diagnosis With Renewable Energy Sources Integration

Schematic Illustration of Proposed Three-facet Analysis: (1) ML Based Conventional Power System Fault Diagnosis, (2) Impact Analysis of RES Integration on ML Models Perfo...

Abstract:

Numerous research works establish the high efficacy of Machine Learning (ML) based power system fault diagnosis over conventional analytical methods. The ongoing integrat...Show More

Society Section: IEEE Power & Energy Society Section

Metadata

Abstract:

Numerous research works establish the high efficacy of Machine Learning (ML) based power system fault diagnosis over conventional analytical methods. The ongoing integration of renewable energy sources (RES) into the existing transmission networks alters the system topology, potentially resulting in significant changes in fault signatures depending on the size of the newly added RES. However, there is a notable absence of studies in the literature analyzing the impact of new RES integration on the fault diagnosis performances of ML models. Therefore, to assess the fault classification and localization performance of potential ML models, this paper proposes to analyze two practical scenarios arising from new RES integrations: 1) when no-fault data is available for the changed system and 2) when the changed system data is available over time. The proposed performance and adaptability testing of potential ML models has been conducted by optimally integrating different sizes of RES into ‘IEEE 9-Bus System’. The integrated solar-based RES has been modeled incorporating standard temperature and irradiance variations. A diverse fault database was generated considering actual field variations of fault attributes. Impact analysis revealed significant degradation in the fault diagnosis performances of all tested models post RES integrations. The adaptability testing was performed by extensive analysis of the learning trends of ML models with gradual data availability. The proposed Bayesian ridge regression has emerged as the fastest learning model for locating transmission line faults, whereas XGBoost, Extra Tree, and Random Forest classifiers gave comparable results for fault classification.

Society Section: IEEE Power & Energy Society Section

Schematic Illustration of Proposed Three-facet Analysis: (1) ML Based Conventional Power System Fault Diagnosis, (2) Impact Analysis of RES Integration on ML Models Perfo...

Published in: IEEE Access ( Volume: 12)

Page(s): 94092 - 94115

Date of Publication: 08 July 2024

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2024.3425057

References is not available for this document.

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

The power system consists of remotely placed generating units where bulk power is transferred through high-voltage transmission networks to the city centers and distributed to the consumers through low-voltage distribution networks [1]. Electric transmission networks are susceptible to different kinds of faults, which can occur at any random location on the interconnected high-voltage lines, resulting in full or partial disconnection of the transmission lines. Therefore, fast and accurate identification of transmission line faults and their location is of significant importance to prevent cascading failure leading to blackouts and for the secure and uninterrupted transfer of bulk electric energy over long distances [2]. Heating due to high temperatures, insulation breakdown, sudden load changes, switching actions, and lines breaking due to natural calamities can lead to permanent faults [3]. Once a permanent fault occurs, it is necessary to classify (to analyze the extent of repair needed) and locate the faults for quick maintenance. There are five types of permanent faults in a power system: Single-line-to-ground (SLG), Double-line-to-ground (DLG), Line-to-line (LL), Line-to-line-to-line-to-ground (LLLG), and Line-to-line-to-line (LLL) faults. Traditionally, manual inspections or model-based methods, namely impedance and traveling waves, were used to locate faults in power systems [4], [5]. However, the impedance-based method requires complex mathematical modeling, domain expertise, time, and several assumptions in their implementation, making them inaccurate and less reliable for modern power systems [6]. While traveling wave-based methods do not require complex mathematical modeling, they require manual operation or installation of costly devices at each end of transmission lines. Further, these are not suitable for lines with tapping, such as three-terminal lines or insertion of loads and sources [4], [7], [8].

The availability of synchronized data measurement devices and data loggers has paved the way for learning-based methods such as Machine Learning (ML) models for the fault diagnosis of power systems, leading to considerable research on data-driven power system monitoring, control, and maintenance [9]. Several works on ML-based power system fault detection, classification, and localization are reported in the literature. Various models, including Logistic Regression (LR), K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Neural Network (NN), Naïve Bayesian (NB), Decision Tree (DT), Gaussian Process Regression (GPR), and Ensemble methods have been proposed for power systems fault diagnosis [5].

LR has been used as a base classifier model for fault detection and classification [10]. KNN has found application as a classifier for fault detection in microgrids [11], fault classification in two-bus systems [12], fault classification in the IEEE 9 Bus system [13], and faulty zone detection in the IEEE 9 Bus System [14]. Many researchers have incorporated NB as a base classifier model for fault classification in two-bus systems [15] and microgrid fault detection and classification [16]. DT has been utilized as a base model for both fault detection and classification [17], [18], and for faulty zone detection [14] in two bus systems. A regression tree-based approach was implemented by [19] for fault localization in the IEEE 39 Bus System. SVM, a prevalent classification and regression model in the literature, has been extensively used for power systems fault diagnosis. SVM was applied for high-impedance fault detection in the IEEE 13 Bus System [20] and distribution feeders [21]. SVM is found to be an excellent classifier for fault classification in different transmission networks [16], [22], [23], [24], [25]. Similarly, works by [19], [26], [27], and [28] have all utilized SVM regression (SVR) for fault localization in various power networks. GPR has also been employed for fault localization [29], [30], [31]. Interest in ensemble methods is growing in the literature, with numerous works being published on one or more ensemble methods. Random Forest (RF) has been utilized for fault classification [19], classification and localization [32], and fault location detection [14]. Other ensemble models such as Bagging, Boosting, and AdaBoost are also present in the literature for fault classification in transmission and distribution networks [13], [33], classification, and faulty branch identification [34]. Many studies on fault classification and faulty line/zone detection have demonstrated outstanding performance on the studied power system networks. However, fewer studies on fault localization using ML regression are available in the literature. Moreover, power system fault data, comprising line currents and voltages measured at different buses or network nodes, exhibits some degree of correlation despite variations in fault attributes. In the literature, it has been observed that Bayesian Ridge regression is an ideal technique for datasets with multicollinearity [35]. Therefore, it has been proposed and compared with Extra Tree (ET) regressor and other potential ML regression models in this paper for the localization of transmission network faults. XGBoost is a new and advanced ML ensemble technique, while Bayesian ridge regression and ET have been utilized in literature in other fields for the past few years. However, they have not yet been applied to power system fault localization. Furthermore, the exceptional performance of RF in power system fault classification has motivated exploration into other potential ensemble techniques, such as ET and XGBoost, which have not been widely used in power system fault diagnosis. XGBoost has been noted as a fast ensemble technique due to multithreading parallel computing [36]. Compared with RF, ET has been found to perform equally well and has lower complexity [37].

The integration of solar and wind energy-based electrical power plants into distribution and transmission networks has been driven by the scarcity of conventional energy resources and their adverse environmental impacts [38]. These renewable energy plants/units can vary widely in capacity, ranging from small-scale units in kilowatts to large-scale installations in megawatts. Typically, large-scale renewable energy generating units are integrated into the transmission level, while smaller-scale units are connected to distribution networks. Over the past decades, numerous large-scale renewable energy sources (RES) have been integrated into grids worldwide [39], with many more expected to follow. This integration reduces reliance on conventional coal-based power plants, thereby reducing greenhouse gas emissions [40], [41], [42]. While RES-based power plants offer environmental benefits, their integration presents challenges to grid operations. The complexity of power systems increases with RES integration, heightening system vulnerability [42]. Ensuring stability and effective power management in RES integrated power networks, balancing supply and demand, and setting protection devices appropriately becomes particularly challenging. Although power flow in transmission lines is typically unidirectional, integrating RES at different locations introduces tapping points and enables bi-directional power flow in the lines [43]. Additionally, RES can feed fault currents during faults, which results in increased fault currents in lines [44]. Consequently, the signature of a fault occurring at a location will change with the inclusion of a new RES unit, even if the fault attributes remain the same. However, limited works have been reported in the literature for ML-based power system fault diagnosis with RES integration [45]. Existing works include Support Vector Data Description-based faulty region identification for distributed energy resources (DER) integration [46] and SLG fault detection for varying levels of DER penetrations in distribution networks [9]. A fault classification study has also been conducted for distribution networks with two distributed generation (DG) units using a convolutional neural network (CNN) [47]. Recently, a faulty line identification approach for RES integrated system utilizing a deep learning framework, with CNN layers for feature extraction from voltage and current waveforms, has been proposed [8].

Based on the literature survey above, it can be deduced that ML-based fault classifiers perform satisfactorily for conventional power system networks [5]. However, the performance of these classification schemes has not been sufficiently tested with RES integrated power systems. Furthermore, studies have yet to explore how the integration of a new RES into a conventional power network impacts the performance of these ML classifiers. Therefore, given the growing trend of RES integration into transmission networks [38], there is a pressing need for performance analysis of potential ML models post RES integration before their implementation in actual power systems. Additionally, it is noted that most fault localization schemes are aimed at identifying faulty lines or sections of transmission and distribution networks using classification approaches [14], [34]. However, pinpointing the exact fault location on a line using ML regressors holds greater value for expedited maintenance of transmission networks. Notably, there is no existing literature on identifying the precise fault location on transmission lines in RES integrated transmission systems using either ML-based classification or regression techniques. Moreover, the wide variability in power generation from RES plants due to weather conditions, particularly temperature and irradiance fluctuations throughout the day and year, directly impacts the power output from solar PV-based RES [48]. Consequently, the power fed into the grid fluctuates, leading to variations in fault current levels [49]. Therefore, it is imperative to consider temperature and irradiance variations while analyzing solar PV-based RES integrated transmission networks [50]. Nevertheless, existing literature on ML-based power system fault diagnosis overlooks these critical issues.

The integration of RES into an existing transmission network can significantly change the power system topology and fault characteristics, depending on the size of the added RES unit [43]. Large fluctuations in power generation from RES can result in substantial deviations of fault currents from their normal values [49], potentially leading to higher misclassification rates of ML models and significant errors in fault location estimation. However, to date, no reported study in the literature examines the performance of ML models for transmission line fault diagnosis following the integration of new RES of varying sizes. Therefore, further research is needed to investigate how ML models utilized for fault diagnosis in existing power systems are affected by the integration of new RES of diverse sizes. Additionally, the literature review suggests that only limited studies are available on ML-based fault diagnosis of RES integrated power systems. Furthermore, these studies assumed that RES had been integrated long ago and that sufficient fault data representing diverse fault variations were available for training ML models. However, this is not true when a new RES is integrated into a real-world power network. Transmission line faults statistically occur infrequently in real-world power systems [51]. Gathering diverse fault data, including various fault locations, types, and attributes, typically requires several years. Therefore, there is a very low probability of different types of faults occurring with significant variations in fault attributes shortly after RES integration. Hence, it is crucial to analyze the performance of ML models while considering these practical issues to ensure uninterrupted power transfer through lines, meeting current and future needs of power system protection and maintenance [43]. Therefore, this paper aims to fill this research gap by analyzing a power system in which the system topology changes due to the integration of RES of varying sizes, and fault data for the altered system is unavailable for ML-based transmission line fault classification and localization. An adaptability analysis of the considered ML models for power system fault diagnosis post RES integrations has also been conducted. This analysis will assist in selecting appropriate ML models based on their learning capabilities for the changed system topology under the practical condition of minimal fault data availability over time. The main objectives and contributions of the research work presented in this paper are as follows:

To analyze and compare the performance of XGBoost, Extra Tree, and Bayesian ridge regression with the potential ML models used to classify and locate power system faults considering diverse fault attributes and the impact of temperature and irradiance on power generation from different sizes of Solar PV RES.
To analyze and compare the impact of RES integrations on transmission line fault classification and localization performance of various ML models.
To test the adaptability performance of the ML models for classification and localization of faults after RES integrations considering real-world power systems scenarios of fault data availability over time for identifying models capable of rapid learning with minimal samples of new fault data post RES integrations.

To achieve these objectives, fault data has been acquired for the standard IEEE 9 Bus system and various sizes of RES integrated IEEE 9 Bus system. Subsequently, the proposed analyses were conducted using three practical power system scenarios. The ML-based fault classification and localization performance have been analyzed and compared, firstly, in the context of conventional power systems, secondly, for assessing the performance of ML models after integrating RES, and lastly, performing adaptability analysis of ML models considering the fault data availability of the changed system over time.

The remaining sections of the paper are organized as follows: Section II describes the test system and ML models used in the study. Section III presents the methodology adopted to conduct the proposed study and the scenarios under examination. Section IV comprehensively describes fault data generation for the standard IEEE 9 bus system and the IEEE 9 bus system with RES integrations. This section also explains various fault attributes considered during the creation of the fault database. Section V presents the results and discussion of transmission line fault classification and localization conducted for all three scenarios. This section also presents the analysis of adaptability trends exhibited by machine learning classifiers and regressors in response to the incremental availability of fault data post RES integrations. Finally, the conclusions derived from this study have been stated in section VI.

SECTION II.

An Overview of the Test System and ML Models

This section describes the test system and the machine learning models chosen for the proposed analysis. It also discusses the rationale for selecting the transmission network and RES integration cases, which were considered for studying the impact and adaptability performance of the ML models. Additionally, this section provides an overview of the key features of the selected ML models, along with the values of the hyperparameters used for training the models in the proposed study.

A. IEEE 9 Bus System

For fault localization studies, numerous ML-based fault diagnosis analyses have been conducted in power system literature on simple networks, such as two-bus transmission lines of varying voltage levels and lengths. However, standard IEEE transmission and distribution systems have been preferred as they represent real power systems. Several studies on power system fault diagnosis have utilized standard IEEE systems, including the IEEE 9 Bus [14], 14 Bus [52], 39 Bus [53], and 68 Bus [54] in the transmission network, and IEEE 4 Bus [55], 13 Bus [56], 33 Bus [57], and 34 Bus [58] in the distribution network. Furthermore, combined transmission and distribution networks such as IEEE 123 Bus [59] have been widely referenced. Generally, these studies were conducted on large systems, considering two or three network lines. The standard IEEE 9 Bus System comprises six transmission lines. Therefore, instead of selecting one or two transmission lines from a larger transmission network, the IEEE 9 Bus system was chosen to facilitate the analysis of RES integration’s impact on the complete network. The IEEE 9 bus comprises nine buses, three synchronous generating units, six transmission lines, three two-winding transformers, and three loads [60], [61]. The lengths of the six interconnected transmission lines are as follows: Line 4–5 is 116.798km, Line 4–6 is 115.131 km, Line 7–5 is 211.955 km, Line 7–8 is 98.907 km, Line 8–9 is 138.603 km, and Line 9–6 is 235.579 km at a frequency of 50 Hz. The single-line representation of the IEEE 9 Bus system is shown in Figure 1.

FIGURE 1.

Single line diagram of IEEE 9 bus system.

Performance and Adaptability Testing of Machine Learning Models for Power Transmission Network Fault Diagnosis With Renewable Energy Sources Integration

Alerts

Abstract:

Metadata

Abstract:

Introduction

An Overview of the Test System and ML Models

A. IEEE 9 Bus System

B. Res Placement and Size Selection

C. Machine Learning Models Selection

Proposed Methodology

Data Generation

A. Conventional Power System Fault Data

1) Faulty Phase and Fault Types

2) Fault Resistance

3) Fault Inception Angle

4) Fault Distance

B. Res Integrated Fault Data

Results and Discussion

A. Conventional Power System (Scenario 1)

1) Fault Classification

2) Fault Localization

B. Impact Analysis of Res Integration (Scenario 2)

1) Fault Classification

2) Fault Localization

C. Adaptability Analysis After Res Integration (Scenario 3)

1) Fault Classification

2) Fault Localization

D. Result Analysis

Conclusion and Future Work

References