Journals & Magazines >IEEE Transactions on Power Sy... >Volume: 33 Issue: 5

Wide-Area Monitoring of Power Systems Using Principal Component Analysis and $k$ -Nearest Neighbor Analysis

Abstract:

Wide-area monitoring of power systems is important for system security and stability. It involves the detection and localization of power system disturbances. However, th...Show More

Metadata

Abstract:

Wide-area monitoring of power systems is important for system security and stability. It involves the detection and localization of power system disturbances. However, the oscillatory trends and noise in electrical measurements often mask disturbances, making wide-area monitoring a challenging task. This paper presents a wide-area monitoring method to detect and locate power system disturbances by combining multivariate analysis known as Principal Component Analysis (PCA) and time series analysis known as

$k$ -Nearest Neighbor (

$k{\text{NN}}$ ) analysis. Advantages of this method are that it can not only analyze a large number of wide-area variables in real time but also can reduce the masking effect of the oscillatory trends and noise on disturbances. Case studies conducted on data from a four-variable numerical model and the New England power system model demonstrate the effectiveness of this method.

Published in: IEEE Transactions on Power Systems ( Volume: 33, Issue: 5, September 2018)

Page(s): 4913 - 4923

Date of Publication: 30 January 2018

ISSN Information:

DOI: 10.1109/TPWRS.2017.2783242

Funding Agency:

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

Nomenclature

$A{I_{{T^2}}}$	Monitoring statistic built by applying $k{\text{NN}}$ on ${T^2}$.
$AI_{{T^2}}^\alpha $	Detection threshold with confidence level $\alpha $ for $A{I_{{T^2}}}$.
$AI_{{T^2},p}^ \circ $	$p$th value of $A{I_{{T^2}}}$ calculated online.
$A{I_Q}$	Monitoring statistic built by applying $k{\text{NN}}$ on $Q$.
$AI_Q^\alpha $	Detection threshold with confidence level $\alpha $ for $A{I_Q}$.
$A{I_{Q,r}}$	$r$th value of $A{I_Q}$ calculated offline.
$AI_{Q,p}^ \circ $	$p$th value of $A{I_Q}$ calculated online.
$a$	Number of principal components.
$\boldsymbol{C}$	Covariance matrix of normalized variables.
${{\bf con}}_{A{I_Q},p}^ \circ $	Vector of contributions of variables to $A{I_Q}$ at the $p$th sampling time point online.
${{\bf con}}_{A{I_{{T^2}}},\ p}^ \circ $	Vector of contributions of variables to $A{I_{{T^2}}}$ at the $p$th sampling time point online.
${\rm{CPV}}(a)$	Ratio percentage of sum of ${\lambda _1},{\lambda _2}, \cdots,{\lambda _a}$ over sum of ${\lambda _1},{\lambda _2}, \cdots,{\lambda _m}$.
${D^2}$	Square of Euclidean distance between two windows.
$d$	Derivative operator.
$\boldsymbol{e}$	Vector of residual variables obtained by PCA.
${e_i}$	$i$th residual variable.
$g$	Number for a data window formulated offline.
$\boldsymbol{h}$	Vector of principal components obtained by PCA.
${h_i}$	$i$th principal component.
${{{\bf I}}_m}$	Identity matrix with dimension as $m \times m$.
$k$	Parameter for $k{\text{NN}}$.
$L$	Length of data window.
$l$	Temporary variable for counting from 1 to $L$.
$m$	Number of measured variables.
$N$	Size of modelling dataset.
$n$	Sampling time point for offline data.
$p$	Sampling time point for online data.
$Q$	Squared Prediction Error (SPE) statistic calculated based on residual variables.
${Q_n}$	$n$th value of $Q$ calculated offline.
$Q_p^ \circ $	$p$th value of $Q$ calculated online.
$r$	Number for a data window formulated offline, and $r \ne g$.
${{\rm{r}}_1},{{\rm{r}}_2}$	Specific values of $r$ (constants).
${s_i}$	$i$th sinusoidal signal.
${s_{i,t}}$	Value of ${s_i}$ at the time of $t$.
${T^2}$	Hotelling's statistic calculated based on principal components.
${T^2}_n$	$n$th value of ${T^2}$ calculated offline.
${T^2}_p^ \circ $	$p$th value of ${T^2}$ calculated online.
$t$	Continuous time.
$\boldsymbol{U}$	Matrix with columns as ${\boldsymbol{u}_1}\ {\boldsymbol{u}_2}\ \cdots \ {\boldsymbol{u}_m}$.
${\boldsymbol{U}_{1:a}}$	Matrix with columns as ${\boldsymbol{u}_1}\ {\boldsymbol{u}_2}\ \cdots \ {\boldsymbol{u}_a}$.
${\boldsymbol{u}_i}$	$i$th eigenvector of $\boldsymbol{C}$.
$\boldsymbol{x}$	Vector of measured variables.
${\boldsymbol{x}_n}$	$n$th vector value of $\boldsymbol{x}$ for offline modelling.
${x_i}$	$i$th measured variable.
${x_{i,n}}$	$n$th value of ${x_i}$ for offline modelling.
${x_{i,t}}$	Value of ${x_i}$ at the time of $t$.
$\tilde{\boldsymbol{x}}$	Vector of normalized variables.
${\tilde{\boldsymbol{x}}_n}$	$n$th vector value of $\tilde{\boldsymbol{x}}$ calculated offline.
$\tilde{\boldsymbol{x}}_p^ \circ $	$p$th vector value of $\tilde{\boldsymbol{x}}$ calculated online.
${\tilde{x}_i}$	$i$th normalized variable.
${\tilde{x}_{i,n}}$	$n$th value of ${\tilde{x}_i}$ calculated offline.
$\boldsymbol{Z}$	Embedding matrix of $Q$ formulated offline.
${\boldsymbol{z}_r}$	$r$th data window formulated offline ( $r$th row of $\boldsymbol{Z}$).
${\boldsymbol{z}_g}$	$g$th data window formulated offline ( $g$th row of $\boldsymbol{Z}$).
$\boldsymbol{z}_p^ \circ $	$p$th data window formulated online.
$\alpha $	Confidence level for detection thresholds.
$\boldsymbol{\Lambda }$	Diagonal matrix with diagonal elements as ${\lambda _1},{\lambda _2}, \cdots,{\lambda _m}$.
${\lambda _i}$	$i$th eigenvalue of $\boldsymbol{C}$.
$\boldsymbol{\Omega }$	Diagonal matrix with diagonal elements as $\lambda _1^{ - 1},\lambda _2^{ - 1}, \cdots,\lambda _a^{ - 1}$.

SECTION I.

Introduction

WIDE-AREA monitoring of power systems plays a crucial role in understanding the system behavior and improving the system operating stability margin. It usually places much emphasis on the detection and localization of disturbances, because disturbances pose an increasingly severe threat to the system security and stability [1].

Generally, disturbances deteriorate the system health by making a power system deviate from the normal operating status. With more and more advanced measuring devices such as Phasor Measurement Units (PMUs) spreading across power systems, abundant measurements containing the information of the system operating status are available for analysis. How to extract such information from the measured data for disturbance detection and localization is an important issue for power system researchers [2]. Generally, the existing data-driven methods can be divided into three categories according to the applications: (1) for the protection of power system equipment, e.g., the wavelet coefficient energy based method [3] and the hidden Markov model based method [4]; (2) for the analysis of power quality especially the waveform of alternate voltage, e.g., the Hilbert-Huang transform based method [5] and the power quality state estimation based method [6]; (3) for the assessment of the system security and stability, typically by multivariate statistical analysis based methods [7]–[10].

Usually, the first two categories of methods take a univariate approach to analyze electrical variables separately. In contrast, the third category of methods use a multivariate approach to handle variables together, particularly suitable for wide-area monitoring of power systems where many variables need to be analyzed simultaneously. This work focuses on the latter.

Principal Component Analysis (PCA), one of the classical multivariate statistical analysis techniques, is well-known for its capability of compressing high-dimensional and correlated data without significant loss of information. It obtains Principal Components (PCs) that are uncorrelated and Residual Variables (RVs) by projecting physical variables onto a low-dimensional subspace that retains most of the variances of the projected variables [11]. To measure the variation of PCs within the PCA model and the variation of RVs not accounted for by the PCA model, two popular monitoring statistics were used respectively, that is, the Hotelling's ${T^2}$ statistic calculated as the sum of the squares of normalized PCs and the companion Squared Prediction Error (SPE or $Q$) statistic calculated as the sum of the squares of RVs [11]. The PCA model together with the ${T^2}$ and $Q$ statistics, known as the PCA-based statistical monitoring method, have been widely applied for process monitoring in the chemical industry [11].

In 2013, Barocio et al. [7] introduced the PCA-based statistical monitoring method for the detection and visualization of power system disturbances and discussed its potential for wide-area monitoring of power systems. Subsequently, Liu et al. [8] focused on the geometric interpretation of ${T^2}$ and $Q$ , and showed that by using frequency measurements ${T^2}$ detects generation mismatch events and $Q$ detects islanding events. Recently, Rafferty et al. [9] considered the changing nature of frequency in a power system and developed a moving window PCA based statistical monitoring method updating the PCA model as well as ${T^2}$ and $Q$ after obtaining a new window of frequency measurements. Although the existing works have led to some success in wide-area monitoring of power systems, one issue that affects the monitoring has not been considered.

Specifically, the above works require the amplitude of electrical measurements recorded before and after disturbances to be markedly different so that the amplitude of the ${T^2}$ values and that of the $Q$ values calculated before and after disturbances can also be distinct and thus can be made use of to detect disturbances at the system-wide level. In practice, such a requirement cannot be met all the time, especially for the cases in power systems where electrical measurements often have oscillatory trends and noise [12], [13]. As exemplified in [14], the oscillatory trends and noise in measurements often mask disturbances, making the difference in the amplitude of measurements recorded before and after disturbances not distinguishable. As a result, there is also not much difference in the amplitude of the ${T^2}$ values and that of the $Q$ values calculated before and after disturbances, and therefore it is difficult for ${T^2}$ and $Q$ to detect disturbances using electrical measurements with oscillatory trends and noise.

$k$-Nearest Neighbor $(k{\text{NN)}}$ analysis is a time series analysis method for the detection of anomalous data windows [15]–[18]. As stated in [14], $k{\text{NN}}$ does not require the amplitude of measurements recorded before and after anomalies to be distinct while detecting anomalies. In a recent paper of the authors [19], $k{\text{NN}}$ was introduced and adapted for real-time detection of power system disturbances. However, the method presented in [19] operates in a univariate manner to analyze variables separately and the online computational burden increases with the number of variables increasing.

Against this background, the motivation of this work is to integrate $k{\text{NN}}$ with the PCA-based statistical monitoring method in order that a large number of variables can be analyzed in real time for wide-area monitoring of power systems, and at the same time, the masking effect of the oscillatory trends and noise in electrical measurements on disturbances can be reduced. More specifically, $k{\text{NN}}$ is applied on ${T^2}$ and $Q$ to obtain two new monitoring statistics for detecting disturbances. This paper will show that a $k{\text{NN}}$ analysis in real time of ${T^2}$ and $Q$ leads to more rapid detection of disturbances. The real-time implementation of $k{\text{NN}}$ is achieved by building a recursive calculation strategy for the distance measure of $k{\text{NN}}$ and a fast selection strategy for the $k$th smallest distance value. Finally, disturbance localization is performed by developing a contribution plot strategy which can quantify the contributions of variables to the new monitoring statistics. Case studies conducted on a four-variable numerical model and the New England power system model are used to demonstrate the effectiveness of the proposed method. It is worth noting that the proposed method is not relevant to protective relays since they fall into different categories, as stated previously.

The paper is organized as follows. Section II gives a brief description of wide-area monitoring based on PCA. Section III presents the wide-area monitoring method based on PCA and $k{\text{NN}}$. The application results and analysis of the two case studies are provided in Section IV. Discussions about the proposed method are given in Section V, while our conclusions are drawn in Section VI.

The following notational conventions are used throughout this contribution. Boldface capital and lower-case letters stand for matrices and column vectors respectively, while $\mathbb{R}$ denotes the field of real numbers. The transpose and inverse operators are denoted by ${(\cdot)^{\rm{T}}}$ and ${(\cdot)^{ - 1}}$ respectively.

SECTION II.

Wide-Area Monitoring Based on PCA

In this section, wide-area monitoring based on PCA [7]–[9], referred to as WAM-PCA here, is briefly introduced.

The symbol ${\boldsymbol{x}^{\rm{T}}} = [ {{x_1}\ {x_2}\ \cdots \ {x_m}\ } ]\ $ denotes a vector of $m$ electrical variables measured for monitoring, e.g., frequency, voltage amplitude, active power, reactive power. Historical measurements from the ambient condition are used to form the modelling data $\{ {{\boldsymbol{x}_n}} \}_{n\ = 1}^N$, where $N$ denotes the dataset size and ${\boldsymbol{x}_n}$ denotes the $n$th vector value of $\boldsymbol{x}$. In what follows, PCA is used to analyze the measured variables together and to obtain PCs and RVs through multivariate analysis.

Firstly, the variables in the vector $\boldsymbol{x}$ are normalized with the sample means and sample variances calculated from $\{ {{\boldsymbol{x}_n}} \}_{n\ = 1}^N$ to make the obtained variables independent of their engineering units. The symbol ${\tilde{\boldsymbol{x}}^{\rm{T}}} = [ {{{\tilde{x}}_1}\ {{\tilde{x}}_2}\ \cdots \ {{\tilde{x}}_m}} ]\ $ denotes a vector of the $m$ normalized variables. The covariance matrix of ${\tilde{\boldsymbol{x}}^{\rm{T}}}$ can be estimated based on the normalized data $\{ {\tilde{\boldsymbol{ x}_n}} \}_{n\ = 1}^N$ and the eigenvalue decomposition of $\boldsymbol{C}$ can be implemented as: \begin{equation} \boldsymbol{C} \ = \frac{1}{{N-1}}\ \ \sum\limits_{n= 1}^N {{{\tilde{\boldsymbol{x}}}_n}{{\tilde{\boldsymbol{x}}}_n}^{\rm{T}}} = \ \boldsymbol{U}{\Lambda \ }{\boldsymbol{U}^{\rm{T}}} = \sum\limits_{i= 1}^m {{\lambda _i}{\boldsymbol{u}_i}{\boldsymbol{u}_i}^{\rm{T}}} \end{equation}View Sourcewhere $\boldsymbol{\Lambda } \in {\mathbb{R}^{m \times m}}$ is a diagonal matrix with diagonal elements as the eigenvalues ${\lambda _1},{\lambda _2}, \cdots,{\lambda _m}$ of $\boldsymbol{C}$ in the descending order, while $\boldsymbol{U} \in {\mathbb{R}^{m \times m}}$ is the eigenvector matrix with the column vectors as the eigenvectors ${\boldsymbol{u}_1} \ {\boldsymbol{u}_2} \ \cdots \ {\boldsymbol{u}_m}$ of $\boldsymbol{C}$.

Then, a vector of PCs can be obtained by: \begin{equation} {\boldsymbol{h}^{\rm{T}}} = \left[ {{h_1} \ {h_2} \ \cdots \ {h_a}} \right] \ = {\left({{\boldsymbol{U}_{1:a}}^{\rm{T}}\tilde{\boldsymbol{x}}} \right)^{\rm{T}}} \end{equation}View Sourcewhere $a$ is the number of PCs satisfying $a < m$, and ${\boldsymbol{U}_{1:a}}^{\rm{T}} = {[ {{\boldsymbol{u}_1}\ {\boldsymbol{u}_2}\ \cdots \ {\boldsymbol{u}_a}} ]^{\rm{T}}} \in {\mathbb{R}^{a \times m}}$ is called loading matrix. The sample covariance matrix of PCs is a diagonal matrix with the diagonal elements as ${\lambda _1}\ {\lambda _2}\ \cdots \ {\lambda _a}$.

Concurrently, a vector of RVs can be obtained by: \begin{equation} {\boldsymbol{e}^{\rm{T}}} = \left[ {{e_1} \ {e_2} \ \cdots \ {e_m}} \right] \ = {\left({\tilde{\boldsymbol{x}} - {\boldsymbol{U}_{1:a}}{\boldsymbol{U}_{1:a}}^{\rm{T}}\tilde{\boldsymbol{x}}} \right)^{\rm{T}}} \end{equation}View Source

The variation of PCs within the PCA model can be measured by the ${T^2}$ statistic: \begin{equation} {T^2} = {\boldsymbol{h}^{\rm{T}}} \ \boldsymbol{\Omega h} \ = \sum\limits_{i= 1}^a {{{\left({{h_i}/\sqrt {{\lambda _i}} } \right)}^2}} \end{equation}View Sourcewhere $\boldsymbol{\Omega } \in {\mathbb{R}^{a \times a}}$ is a diagonal matrix with the diagonal elements as $\lambda _1^{ - 1},\lambda _2^{ - 1}, \cdots,\lambda _a^{ - 1}$.

Moreover, the variation of RVs not accounted for by the PCA model can be measured by the $Q$ statistic: \begin{equation} Q\ = {\boldsymbol{e}^{\rm{T}}} \ \boldsymbol{e} \ = \sum\limits_{i= 1}^m {e_i^2} \end{equation}View Source

SECTION III.

Wide-Area Monitoring Based on PCA and KNN

Both ${T^2}$ and $Q$ make use of the difference in their amplitude before and after disturbances for disturbance detection, requiring the amplitude of electrical measurements recorded before and after disturbances to be distinct. However, the oscillatory trends and noise in electrical measurements often have a masking effect on disturbances, making it difficult to satisfy this requirement. Thus, the detection performance of ${T^2}$ and $Q$ will be adversely affected. In this section, $k{\text{NN}}$ is introduced and applied on ${T^2}$ and $Q$ to build two new monitoring statistics for improving the detection performance. The reason why $k{\text{NN}}$ gives the improvement is because $k{\text{NN}}$ does not require the amplitude of a time series before and after disturbances to be distinct [14]. Then, disturbance localization is performed by quantifying the contributions of variables to the new monitoring statistics. Both disturbance detection and localization constitute the subject of wide-area monitoring based on PCA and $k{\text{NN}}$, referred to as WAM-PCA kNN here. In the following, WAM-PCAkNN is presented in detail.

A. Disturbance Detection of WAM-PCAkNN

$k{\text{NN}}$ adopts a certain type of distance measure to assess the similarity of two data windows in a time series, where a data window refers to a segment of data with the fixed length. Data windows with similar sequences of samples are called near neighbors. The similarity assessment is achieved by defining an Anomaly Index (AI) for each data window. Following the definition of AI in [14]–[16], this paper uses the distance of a data window to its $k$th nearest neighbor as AI of that data window. Anomalous data windows are those distinct from the underlying trend of the time series and the AI value for an anomalous data window will be much higher than that of any normal data window, which is the reason why $k{\text{NN}}$ can be used for anomaly detection. A common distance measure to assess the similarity between data windows is Euclidean Distance (ED) [14]–[19], which can be written as: \begin{equation} D\left({\boldsymbol{\varphi },\phi } \right) \buildrel \Delta \over = \sqrt {\sum\nolimits_{j= 1}^L {{{\left({{\varphi _j} - {\phi _j}} \right)}^2}} } \geq 0 \end{equation}View Sourcewhere ${\boldsymbol{\varphi }^{\rm{T}}} = [ {{\varphi _1}\ {\varphi _2}\ \cdots \ {\varphi _L}} ]\ $ and ${\phi ^{\rm{T}}} = [ {{\phi _1}\ {\phi _2}\ \cdots \ {\phi _L}} ]\ $ denote two data windows with $L$ measurements in each one, $D\ ({\boldsymbol{\varphi },\phi }) = \ 0$ indicates the maximum similarity.

This paper also uses ED to assess the similarity of two data windows. The reason why ED is used here instead of other types of distance measures such as Mahalanobis Distance (MD) is because the calculation of ED is much simpler which can facilitate the recursive calculation for the online detection.

If the ${T^2}$ or $Q$ values obtained by ( 4) or (5) are viewed as a time series, the detection of power system disturbances can be achieved by detecting anomalous windows in this time series. Without loss of generality, $Q$ is taken to illustrate the detection process, which also applies to ${T^2}$ . The detection process includes: 1) the offline modelling; 2) the online detection.

1) The Offline Modelling

The offline modelling calculates a sequence of the AI values by using $k{\text{NN}}$ to analyze the $Q$ values. It then calculates a detection threshold based on the obtained AI values for determining whether disturbances occur or not.

Specifically, based on the modelling data $\{ {{\boldsymbol{x}_n}} \}_{n\ = 1}^N$, the $Q$ values $\{ {{Q_n}} \}_{n\ = \ 1}^N$ are calculated by (5) and a matrix $\boldsymbol{Z}$ is built as: \begin{equation} \boldsymbol{Z} \ = \left[ {\begin{array}{c} {{\boldsymbol{z}_1}^{\rm{T}}}\\ {{\boldsymbol{z}_2}^{\rm{T}}}\\ \vdots \\ {{\boldsymbol{z}_{N - L + 1}}^{\rm{T}}} \end{array}} \right]\ = \left[ {\begin{array}{cccc} {{Q_1}}&{{Q_2}}& \cdots &{{Q_L}}\\ {{Q_2}}&{{Q_3}}& \cdots &{{Q_{L+ 1}}}\\ \vdots & \vdots & \cdots & \vdots \\ {{Q_{N - L + 1}}}&{{Q_{N - L + 2}}}& \cdots &{{Q_N}} \end{array}} \right] \end{equation}View Sourcewhere $\boldsymbol{Z}$ is the embedding matrix of $Q$ , its row ${\boldsymbol{z}_r}^{\rm{T}}$ denotes the $r$th data window of $\{ {{Q_n}} \}_{n\ = \ 1}^N$, and $L$ denotes the window length. Two rows can be compared by the Square of ED (SED) as: \begin{equation} {D^2}\ \left({{\boldsymbol{z}_g},{\boldsymbol{z}_r}} \right) = \sum\limits_{l= 1}^L {{{\left({{Q_{g - l + L}} - {Q_{r - l + L}}} \right)}^2}} \end{equation}View Source

The reason for using SED instead of directly using ED is due to the consideration of the calculation efficiency. This can be observed later in Section III-A2. For the $r$th row ${\boldsymbol{z}_r}^{\rm{T}}$, its AI value is calculated as the $k$th smallest SED value between it and all other rows except its near-in-time rows. The near-in-time rows of ${\boldsymbol{z}_r}^{\rm{T}}$ are those having at least one sample in common with ${\boldsymbol{z}_r}^{\rm{T}}$, e.g., ${\boldsymbol{z}_L}^{\rm{T}}$ is the last near-in-time row of ${\boldsymbol{z}_1}^{\rm{T}}$. The exclusion of the SED values between ${\boldsymbol{z}_r}^{\rm{T}}$ and its near-in-time rows during the calculation of AI is to avoid treating such near-in-time rows as near neighbors of ${\boldsymbol{z}_r}^{\rm{T}}$.

When all rows of $\boldsymbol{Z}$ obtain their corresponding AI values, a threshold is needed for the online detection. Based on the sequence of the obtained AI values $\{ {A{I_{Q,r}}} \}_{r\ = \ 1}^{N - L + 1}$ where $A{I_Q}$ denotes the new monitoring statistic built by applying $k{\text{NN}}$ on the $Q$ statistic and $A{I_{Q,r}}$ denotes the $r$th value of $A{I_Q}$, a threshold $AI_Q^\alpha $ with the confidence level $\alpha $ can be calculated as the $\delta $th highest value of this sequence, where $\delta $ is the integer nearest to $({1 - \alpha })({N - L + 1})$ [11].

Similar with $A{I_Q}$, another new monitoring statistic $A{I_{{T^2}}}$ can be built by applying $k{\text{NN}}$ on the ${T^2}$ statistic and the related detection threshold $AI_{{T^2}}^\alpha $ can also be determined.

2) The Online Detection

Next is the online detection, for which real-time calculation of AI is required. To meet this requirement, strategies for recursively calculating SED and for fast selecting the $k$ th smallest SED value are built below.

The symbol $\boldsymbol{z}{_p^ {\circ {\rm{T}}}} = [ {\begin{array}{cccc} {Q_{p - L + 1}^ \circ }&{Q_{p - L + 2}^ \circ }& \cdots &{Q_p^ \circ } \end{array}} ]\ $ denotes the data window of the $L$ continuous $Q$ values calculated based on the new measurements, where $p$ denotes the sampling time point for the online data and the symbol “$ \circ $” is used to distinguish the online data from the offline data. Because all rows of $\boldsymbol{Z}$ in (7) are normal windows with the ambient characteristic, they can be taken as the reference data to test whether $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ deviates from normal or not. If $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ is anomalous, the SEDs between it and all rows of $\boldsymbol{Z}$ will be large. Accordingly, the AI value $AI_{Q,p}^ \circ $ for $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ which is the $k$th smallest SED value will also be large and will go beyond the threshold $AI_Q^\alpha $. For the $r$th row ${\boldsymbol{z}_r}^{\rm{T}}$ of $\boldsymbol{Z}$, the SED between it and $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ can be calculated as: \begin{equation} {D^2}\ \left({\boldsymbol{z}_p^ \circ,{\boldsymbol{z}_r}} \right) = \sum\limits_{l= 1}^L {{{\left({Q_{p - l + 1}^ \circ - {Q_{r - l + L}}} \right)}^2}} \end{equation}View Source

The calculation of (9) needs $({2L- 1})$ additions and $L$ multiplications. So, the online computation load relies largely on the window length $L$. To reduce the number of mathematical operations needed in (9), a recursive calculation strategy, called Strategy ${{\Gamma }}$ here, is built using the result previously calculated.

Strategy ${{\Gamma }}$ for recursively calculating SED

For the window $\boldsymbol{z}{_{p- 1}^ {\circ {\rm{T}}}} = [ {\begin{array}{cccc} {Q_{p- L}^ \circ }&{Q_{p - L + 1}^ \circ }& \cdots &{Q_{p- 1}^ \circ } \end{array}} ]\ $ obtained a sampling time point earlier than $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$, the SED between it and the row ${\boldsymbol{z}_{r- 1}}^{\rm{T}}$ of $\boldsymbol{Z}$ can be calculated as: \begin{equation} {D^2}\ \left({\boldsymbol{z}_{p- 1}^ \circ,{\boldsymbol{z}_{r- 1}}} \right) = \sum\limits_{l= 1}^L {{{\left({Q_{p- l}^ \circ - {Q_{r - l + L - 1}}} \right)}^2}} \end{equation}View Source

Using (9) and (10 ), a recursive equation can be obtained as: \begin{equation} {D^2}\ \left({\boldsymbol{z}_p^ \circ,{\boldsymbol{z}_r}} \right) = \left\{ {\begin{array}{l} {{D^2}\left({\boldsymbol{z}_{p- 1}^ \circ,{\boldsymbol{z}_{r- 1}}} \right) + {{\left({Q_p^ \circ - {Q_{r - 1 + L}}} \right)}^2}}\\ { - {{\left({Q_{p- L}^ \circ - {Q_{r- 1}}} \right)}^2},\ r > = 2}\\ {\sum\nolimits_{l= 1}^L {{{\left({Q_{p - l + 1}^ \circ - {Q_{r - l + L}}} \right)}^2}},\ r = 1} \end{array}} \right. \end{equation}View Source

In comparison to (9), the calculation of ${D^2}({\boldsymbol{z}_p^ \circ,{\boldsymbol{z}_r}})$ in ( 11) only requires four addition and two multiplication operations for $r > = 2$, which is beneficial to the real-time requirement. Here, the reason why SED instead of ED is used can be seen, which is due to the need of the recursive calculation.

Using (11), the sequence of the SED values $\lbrace {D^2}({\boldsymbol{z}_p^ \circ}, {{\boldsymbol{z}_r}}) \rbrace_{r\ = \ 1}^{N - L + 1}$ can be calculated more efficiently. Then, the AI value $AI_{Q,p}^ \circ $ for $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ can be determined as the $k$th smallest SED value. A strategy for fast selection of the $k$th smallest element from a sequence is built and described below.

Strategy ${{\Gamma \Gamma }}$ for fast selection of the $k$th smallest SED

If $k$ elements from a sequence are smaller than the rest, the maximum one of these $k$ elements is the $k$th smallest element of the entire sequence. Strategy ${{\Gamma \Gamma }}$ is built based on such consideration. Firstly, the first $k$ elements of $\{ {{D^2}({\boldsymbol{z}_p^ \circ,{\boldsymbol{z}_r}})} \}_{r\ = \ 1}^{N - L + 1}$ are sorted in the ascending order, denoted as ${D^2}^{(1)},{D^2}^{(2)}, \cdots,{D^2}^{(k)}$. Then, the $({k+ 1})$th element, denoted as ${D^2}^{({{*}})}$, is compared with the $k$ elements. If ${D^2}^{({{*}})}$ is larger than ${D^2}^{(k)}$ , ${D^2}^{({{*}})}$ is removed and the $k$ elements remain unchanged; otherwise, ${D^2}^{(k)}$ is removed and ${D^2}^{({{*}})}$ is put into ${D^2}^{(1)},{D^2}^{(2)}, \cdots,{D^2}^{({k- 1})}$ ensuring the reserved $k$ elements are still in the ascending order. After each element of $\{ {{D^2}({\boldsymbol{z}_p^ \circ,{\boldsymbol{z}_r}})} \}_{r\ = \ k + 1}^{N - L + 1}$ is handled by such comparison, the maximum one of the ultimately reserved $k$ elements is the $k$ th smallest element of the entire SED sequence.

For the best case, ${D^2}^{({{*}})}$ only needs to be compared with ${D^2}^{(k)}$. For the worst case, ${D^2}^{({{*}})}$ needs to be compared with all $k$ elements, e.g., ${D^2}^{({k- 1})} < = {D^2}^{(*)}\ < \ = {D^2}^{(k)}\ $ and ${D^2}^{(*)}$ is compared with ${D^2}^{(1)},{D^2}^{(2)},{D^2}^{(3)}, \cdots,{D^2}^{({k- 1})}$ in turn besides ${D^2}^{(k)}$. To reduce the number of comparisons, the binary search is introduced to search the target position for ${D^2}^{(*)}$. It begins by comparing ${D^2}^{(*)}$ with the middle one of the $k$ elements. If ${D^2}^{(*)}$ is not larger than the middle one, the search continues on the former half of the $k$ elements; otherwise, the search continues on the latter half. The search continues, eliminating half of the elements, and comparing ${D^2}^{(*)}$ to the middle one of the remaining elements, until the target position is found. The number of comparisons is ${\log _2}(k)$ at most, smaller than $k$.

In addition, the binary search is also used to sort the first $k$ elements of the SED sequence in the ascending order by putting them into target positions one by one. The only difference is that, when one of the first $k$ elements is put into the target position, the maximum element does not need to be removed. The number of comparisons is ${\log _2}({k!})$ at most. Thus, the total number of comparisons for Strategy ${{\Gamma \Gamma }}$ is ${\log _2}({k!}) + ({N - L + 1 - k}) \cdot {\log _2}(k)$ at most. Through this strategy, the AI value $AI_{Q,p}^ \circ $ for $\boldsymbol{z}{_p^ {\circ {\rm{T}}}}$ can be obtained as the maximum one of the ultimately reserved $k$ elements and can be compared with the threshold $AI_Q^\alpha $ for the online detection.

Similarly, the AI value $AI_{{T^2},p}^ \circ $ can also be obtained by Strategy ${{\Gamma }}$ and Strategy ${{\Gamma \Gamma }}$, and can be compared with the threshold $AI_{{T^2}}^\alpha $. Thus, disturbance detection of WAM-PCAkNN has been developed, which is summarized in Fig. 1.

Fig. 1.

Disturbance detection of WAM-PCAkNN.

Wide-Area Monitoring of Power Systems Using Principal Component Analysis and kk-Nearest Neighbor Analysis

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Nomenclature

Introduction

Wide-Area Monitoring Based on PCA

Wide-Area Monitoring Based on PCA and KNN

A. Disturbance Detection of WAM-PCAkNN

1) The Offline Modelling

2) The Online Detection

B. Disturbance Localization of WAM-PCAkNN

C. Parameter Settings for WAM-PCAkNN

1) Parameter ${k}$ and Window Length ${L}$

2) Number of PCs

Case Studies

A. Four-Variable Numerical Model

B. New England Power System Model

Discussions

Conclusion

ACKNOWLEDGMENT

Authors

Figures

References

Citations

Keywords

Metrics

References

Wide-Area Monitoring of Power Systems Using Principal Component Analysis and $k$ -Nearest Neighbor Analysis