Introduction
Distribution grid line outage occurrence detection and localization is essential for efficient system monitoring and sustainable system operation [1]. A timely identification of the line outage effectively reduces potential financial loss. According to the U.S. Energy Information Administration, customers had an average of 1.3 outages and went without power for four hours during 2016 [2]. The frequency and severity of line outages caused by extreme weather events and power supply shortages have also increased in recent years.
The traditional line outage identification in distribution grids relies on passive feedback from customer reporting [3] or the “last gasp” message from smart meters [4], which is a notification automatically transmitted to the utility when power to the meter is lost. However, the performance of these methods will degrade while the transmission of the “last gasp” signal is not assured [5]. For instance, as the growth of distributed energy resources (DERs) penetration in distribution grids, customer can still receive power from the rooftop solar panels, battery storage, and electrical vehicles when there is no power flow in the distribution circuit connecting to the customer. So the smart meter at the customer premises cannot report a power outage. Moreover, some secondary distribution grids are mesh networks in urban areas. In this scenario, a single line outage caused by circuit faults and human activities may not cause a power outage due to alternative paths for power supply. In this second case, we will also observe smart meters measuring power injections without sending the “last gasp” notification for reporting outages.
While alternative power sources make the “last gasp” notification fail to report outages, can we still find the line outage time and location? Answering this question, recent literature aimed at collecting additional information for smarter decisions. For example, power measurements, such as phasor angles from phasor measurement units (PMUs), were modeled in [6] as a Gaussian Markov random field to track the grid topology change. Other power measurements, like power flows and load estimates, were also utilized in a compressive system [7] and hypothesis-test-based detection method [8]. Non-power measurements were explored as well, such as human network information from social media [9] and the weather information from environment [10]. In distribution grids, obtaining measurements such as micro-PMUs and accurate power flow data can be challenging and costly, as they are not commonly deployed in households. To address this limitation, our earlier research [3] demonstrated that utilizing readily available voltage magnitudes could still yield accurate outage identification outcomes. However, an in-depth examination of the probability distribution of voltage data and a theoretical guarantee for learning this distribution were not included in our previous work. These aspects are crucial for understanding the outage identification procedure. Besides, the method in [3] has feasibility and accuracy issues when learning the probability distribution. In this work, we fill the above gaps via a novel approach with theoretical guarantees.
To utilize the aforementioned measurements, both deterministic and probabilistic methods were proposed. Deterministic methods usually set up a threshold and declared the outage when the change of data exceeds the threshold. Such methods are simple to apply but cannot accurately discern data change in complex or large-scale grids. Probabilistic methods analyzed the data spatially or temporally. For spatial analysis, [11] studied graph spectral to assess the grid topology for line outage detection. However, such methods required the grid topology as a prior. For temporal analysis, tracing the probability distribution change of the time-series measurements is a common approach [3]. This is usually studied in the change point detection framework, which aims to find the distribution change of measurements as quickly as possible under the constraint of false alarm tolerance [12]. Such framework has been used in line outage and fault detection in transmission grids [13], [14] and DC micro-grids [15]. Although the change point detection framework assures optimal performance [16], it typically necessitates knowledge of both distributions before and after the change. Nevertheless, in distribution grids, this requirement is not practical as the post-outage distribution is unpredictable due to the large number of possible outage patterns, whereas the pre-outage distribution can be learned from historical measurements.
For removing the impractical requirement discussed above, methods were proposed to provide approximation or simplification of the unknown post-change distribution in change point detection. For instance, an approximated maximum likelihood estimation of unknown distribution parameters was proposed in [3]. A convexified estimation of the unknown distribution approach was introduced in [17]. [18], [19] bypassed the requirement in restricted distribution cases with partially unknown information (e.g., scalar Gaussian with unknown means and known variances). While these methods may mitigate the incompleteness of post-outage information, they have limitations on detection performance and parameter estimation.
In this article, we propose a practical and straightforward method for utilities to identify line outages with unknown outage patterns. To address the challenge of limited data availability, our approach relies solely on voltage magnitudes obtained from smart meters. This is advantageous compared to expensive phase angle measurements and accurate power flow data, as voltage magnitudes are more readily accessible in typical distribution grids [20]. For the utilize of voltage magnitudes, we have made distinctive contributions. We demonstrate that the increment of voltage magnitudes before and after a line outage follows two distinct multivariate Gaussian distributions, where the distribution parameters are influenced by grid connectivity. Moreover, we provide theoretical guarantees for learning the unknown probability distribution parameters based on voltage magnitude data. By effectively utilizing voltage magnitudes and incorporating theoretical guarantees, we address the limitations posed by the absence of precise phase-angle data. Through the detection of changes in the learned Gaussian distributions, we can successfully identify line outages.
The second challenge is the unavailability of post-outage distribution parameters as analyzed earlier. To address this issue, we propose a data-driven method that directly learns these unknown parameters using Projected Gradient Descent (PGD). While Gradient Descent (GD) is susceptible to feasibility issues in parameter estimation, the iterative nature of GD allows us to control the parameter updating trajectory. Specifically, we formulate the distribution parameter learning problem as a projection optimization problem constrained by the Bergman divergence [21]. This not only resolves the feasibility issue but also leads to accurate parameter estimation with theoretical guarantees. By accurately learning the parameters, our approach can effectively detect and localize line outages, even in large grids.
In addition to accuracy, utilities are also concerned with timely operation. By utilizing the statistical and physical characteristics of voltage data, we can limit the search space of unknown parameters to a convex set, which allows for fast and accurate recovery of the post-outage distribution. We have demonstrated that PGD can achieve optimal parameter learning with a polynomial-time convergence guarantee. Furthermore, we have developed an efficient implementation of the PGD algorithm, which reduces computational time by 75% and makes it particularly well-suited for timely grid operations.
In summary, our proposed method offers several contributions. Firstly, it only requires simple data but have theoretical guarantees. Secondly, it does not require prior knowledge of the outage pattern. Thirdly, it enables timely operation. Furthermore, our approach comes with performance guarantees and does not rely on knowledge of the distribution grid's topology, nor does it require all households to have smart meters data. The method is validated using four distribution grids and real-world load profiles with 17 outage configurations. In the following, Section II models the problem of line outage identification. Section III discusses the voltage data and identification procedure. Section IV extends to identification with unknown outage pattern. Section V provides performance guarantees on timely operation. Section VI evaluates our method. Section VIII concludes the article.
System Model
For showing our probabilistic design for change point detection and localization, we define variables on a graph probabilistically. Specifically, we model the distribution grid as a graph
In the distribution grid
Based on the modeling, the problem of identifying the distribution grid line outage is formally defined as follows.
Given: Voltage increments
from the smart meters.\Delta \mathbf {v}^{1:N} Find: The line outage time as soon as possible and the out-of-service branch as accurate as possible.
Outage Identification Via Voltage Magnitude
While the expensive phasor angles and accurate power flows are hard to obtain in distribution grids, [3] showed that the easier-to-acquire voltage magnitude could be utilized to identify the line outage. The authors found that although voltage data do not follow a regular distribution, the incremental change of voltage follows Gaussian distribution. However, two things were missing in [3]: a clear formula of the distribution and an elaborate analysis of how such distribution is affected by line outages. They are the key to understanding the procedure and performance of identifying the line outage, which will be discussed in detail in the following subsection.
A. Gaussian Distribution of Voltage Increment
For answering the missing question, in this subsection, we elaborately prove that the increment of voltage data
To study the distribution of
\begin{equation*}
\boldsymbol{Y}_\mathcal {G}= \boldsymbol{A}_{\mathcal {E}, \mathcal {G}}^\top \boldsymbol{Y}_\mathcal {E}\boldsymbol{A}_{\mathcal {E}, \mathcal {G}}+ \boldsymbol{Y}_\mathcal {G}^{s}. \tag{1}
\end{equation*}
In the above equation,
By representing
Lemma 1:
In a connected distribution grid
In the following, we consider the eliminated admittance matrix and keep the notation unchanged for convenience. Based on Lemma 1, the relationship between voltage increments
\begin{equation*}
\Delta \mathbf {V}_{\mathcal {G}} = \boldsymbol{Z}_\mathcal {G}\Delta \mathbf {I}_{\mathcal {G}}, \quad \text{where} \quad \boldsymbol{Z}_\mathcal {G}=\boldsymbol{Y}_\mathcal {G}^{-1}. \tag{2}
\end{equation*}
To derive the distribution of
\begin{equation*}
\Delta I_{i}\bot \Delta I_{k}, \ i\ne k \quad \text{and} \quad \Delta I_{k}\sim \mathcal {N}(\mu _{k},\sigma ^{2}_{k}), \ k\in \mathcal {G}. \tag{3}
\end{equation*}
This statement is adopted and validated by real data in many works [23], [24], [25], where the authors computed the mutual information between current injections to justify the independence. The empirical histogram in Fig. 2 also suggests that
Overview of the distribution grid line outage detection problem: we collect voltage magnitudes from smart meters installed at households and use the posterior probability ratio computed in (6) to detect the change in the underlying distribution of voltage increments.
Theorem 1:
Provided that (3) hold,
follows a multivariate Gaussian distribution,
still follows a multivariate Gaussian distribution (with different mean and covariance) after grid topology changes.
Proof:
With (2),
\begin{equation*}
\Delta \mathbf {V}_{\mathcal {G}} \sim \mathcal {N}(\boldsymbol{\mu}, \boldsymbol{\Sigma}), \tag{4}
\end{equation*}
When the grid topology is changed (e.g., due to a line outage), the incidence matrix
The grid network is still connected (which is our focus in this article). In this case, the changed admittance matrix
is still invertible, which results in a varied\widetilde{\boldsymbol{Y}}_\mathcal {G}=\widetilde{\boldsymbol{A}}_{\mathcal {E}, \mathcal {G}}^\top \boldsymbol{Y}_\mathcal {E}\widetilde{\boldsymbol{A}}_{\mathcal {E}, \mathcal {G}}+ \boldsymbol{Y}_\mathcal {G}^{s} . It implies that\widetilde{\boldsymbol{Z}}_\mathcal {G}=\widetilde{\boldsymbol{Y}}_\mathcal {G}^{-1} still follows a multivariate Gaussian distribution, only with different mean\Delta \mathbf {V}_{\mathcal {G}} and different covariance\widetilde{\boldsymbol{\mu}} calculated according to\widetilde{\boldsymbol{\Sigma}} .\widetilde{\boldsymbol{Z}}_\mathcal {G} The grid network is disconnected. In this case, we view the network as disjoint islands where each part is a connected sub-network, e.g.,
. By doing so, we can write the incidence matrix in block format, e.g.,\mathcal {G}=\mathcal {G}_{1}\cup \mathcal {G}_{2},\mathcal {G}_{1}\cap \mathcal {G}_{2}=\emptyset . According to the first case, voltage increments in each sub-network follow a multivariate Gaussian distribution, and so does their joint distribution. In this scenario, since some houses lose power connection and will have zero voltages, the outage time and location can be more easily found via our approach.\widetilde{\boldsymbol{A}}_{\mathcal {E}, \mathcal {G}}=\left(\begin{matrix}\widetilde{\boldsymbol{A}}_{\mathcal {E}_{1}, \mathcal {G}_{1}}&\boldsymbol{0}\\ \boldsymbol{0}&\widetilde{\boldsymbol{A}}_{\mathcal {E}_{2}, \mathcal {G}_{2}}\end{matrix}\right) \hfill\square
Suppose the outage occurs at time
\begin{align*}
\left\lbrace \begin{array}{ll}\Delta \mathbf {v}[n] \stackrel{i.i.d}{\sim } g:\mathcal {N}(\boldsymbol{\mu}_{0}, \boldsymbol{\Sigma}_{0}), \quad n=&1,2,\ldots,\lambda -1,\\
\Delta \mathbf {v}[n] \stackrel{i.i.d}{\sim } f:\mathcal {N}(\boldsymbol{\mu}_{1}, \boldsymbol{\Sigma}_{1}), \quad n=&\lambda,\lambda +1,\ldots,N, \end{array} \right. \tag{5}
\end{align*}
B. Outage Identification Via Distribution Change
Before proposing our novel solution to unknown outage pattern, we present the commonly used framework to find the outage time
To identify the outage time
\begin{align*}
&\Lambda (\Delta \mathbf {v}^{1:N})=\frac{\mathbb{P}(\lambda \leq N|\Delta \mathbf {v}^{1:N})}{\mathbb{P}(\lambda >N|\Delta \mathbf {v}^{1:N})} \\
&\quad= \frac{\sum _{k=1}^{N}\pi (k)\prod _{n=1}^{k-1}g(\Delta \mathbf {v}[n])\prod _{n=k}^{N}f(\Delta \mathbf {v}[n])}{\sum _{k=N+1}^{\infty }\pi (k)\prod _{n=1}^{N}g(\Delta \mathbf {v}[n])}, \tag{6}
\end{align*}
Theorem 2 (Line Outage Detection):
When
\begin{equation*}
\tau =\inf \lbrace N\in \mathbb{N}:\Lambda (\Delta \mathbf {v}^{1:N})\geq B_{\rho,\alpha }\rbrace, \tag{7}
\end{equation*}
\begin{align*}
\mathbb{E}[\tau -\lambda |\tau \geq \lambda ] &= \frac{|\log \alpha |}{-\log (1-\rho)+D_{KL}(f||g)}\\
&= {\textstyle \inf }_{\mathbb{P}(\tau ^{\ast }\leq \lambda)\leq \alpha }\mathbb{E}[\tau ^{\ast }-\lambda |\tau ^{\ast }\geq \lambda ], \tag{8}
\end{align*}
One notable feature of the detection procedure described above is its ability to function effectively without requiring knowledge of the grid topology. Additionally, it can handle non-Gaussian distributions for
Once the line outage occurrence is detected, localizing the out-of-service branch is also critical for system recovery. In [3], the authors proposed an accurate outage localization method by proving that the voltage increments of two disconnected buses are conditionally independent. They computed the conditional correlation of every possible pair of buses in the grid and checked if the value changes from non-zero to zero. This approach differs from the utilization of nodal electric circuit matrices [26], [27] for estimating fault location, while our approach has also been effective (as shown in Section VI-D) and capitalizes on the learned covariance matrix in scenarios where the post-outage distribution is unknown.
To estimate the conditional correlation between bus
\begin{equation*}
\rho _{ik}(\boldsymbol{\Sigma}) = \frac{\boldsymbol{\Sigma}_{\mathcal {I}|\mathcal {K}}(1,2)}{\sqrt{\boldsymbol{\Sigma}_{\mathcal {I}|\mathcal {K}}(1,1)\boldsymbol{\Sigma}_{\mathcal {I}|\mathcal {K}}(2,2)}}, \tag{9}
\end{equation*}
Theorem 3 (Line Outage Localization):
The conditional correlation is calculated based on (9) for every pair of
\begin{equation*}
\underbrace{\rho _{ik}^{-} = \rho _{ik}(\boldsymbol{\Sigma}_{0})}_{{before\ outage}}\quad \text{and}\quad \underbrace{\rho _{ik}^{+} = \rho _{ik}(\widehat{\boldsymbol{\Sigma}}_{1}) }_{{after\ outage}}. \tag{10}
\end{equation*}
According to Theorem 3, we track the change of covariance matrices to localize the out-of-service branch. Specifically, an out-of-service branch between bus
Outage Identification With Unknown Pattern
The detection and localization procedure in Section III requires knowing all the parameters of
To resolve such issue, we propose a data-driven framework to learn the post-outage distribution parameters
\begin{align*}
(\widehat{\boldsymbol{\mu}}_{1}, \widehat{\boldsymbol{\Sigma}}_{1}) &= \arg \min _{(\boldsymbol{\mu}_{1},\boldsymbol{\Sigma}_{1})} L(\boldsymbol{\mu}_{1},\boldsymbol{\Sigma}_{1}), \tag{11}
\end{align*}
\begin{align*}
-\sum _{k=1}^{N}\pi (k)\prod _{n=1}^{k-1}g(\Delta \mathbf {v}[n])\prod _{n=k}^{N}f(\Delta \mathbf {v}[n]|\boldsymbol{\mu}_{1},\boldsymbol{\Sigma}_{1}). \tag{12}
\end{align*}
To address the non-convex nature of the likelihood expressed in (12), the authors in [3] proposed a convex approximation using Jensen's inequality and derived closed-form solutions for (11). However, the use of Jensen's inequality can introduce inaccuracies in the resulting closed-form solutions, particularly in determining the minimum point. Furthermore, the estimated covariance matrix may not always be feasible. Specifically, a feasible covariance matrix must be positive definite, i.e.,
An alternative approach is using the Gradient Descent (GD) to find the solution to (11). While the vanilla GD also can not ensure the aforementioned feasibility of the parameters, the iterative learning nature in GD enables us to control the updating trajectory of parameters.
A. Unknown Parameters Estimation Via Projected Gradient Descent With Bregman Divergence Constraint
To guarantee that the estimation of parameters
\begin{equation*}
\boldsymbol{\theta}_{i}^{(e+1)}= \arg \min _{\boldsymbol{\theta}_{i}} \underbrace{\Delta _{\Phi } (\boldsymbol{\theta}_{i}, \boldsymbol{\theta}_{-i}^{(e)})}_{\text{Bregman divergence}} + \ \eta L(\boldsymbol{\theta}_{i}, \boldsymbol{\theta}_{-i}^{(e)}). \tag{13}
\end{equation*}
In the above equation,
\begin{align*}
\Delta _\Phi (\boldsymbol{\theta}_{i}, \boldsymbol{\theta}_{i}^{(e)}):= \Phi (\boldsymbol{\theta}_{i}) - \Phi (\boldsymbol{\theta}_{i}^{(e)}) - \text{tr}\left((\boldsymbol{\theta}_{i}-\boldsymbol{\theta}_{i}^{(e)})\Phi (\boldsymbol{\theta}_{i}^{(e)})^\top \right)
\end{align*}
Finding the solution to (13) relies on one characteristic of Bregman divergence: its gradient with respect to
Lemma 2:
The optimization problem in (13) is solved as
\begin{equation*}
\boldsymbol{\theta}_{i}^{(e+1)}= \dot{\Phi }^{-1}\left(\dot{\Phi }(\boldsymbol{\theta}_{i}^{(e)}) + \eta \nabla _{\boldsymbol{\theta}_{i}}L(\boldsymbol{\theta}^{(e)})\right). \tag{14}
\end{equation*}
With Lemma 2, we propose the important result of our article: the learning scheme of unknown parameters with feasibility guarantee. We will further show that this learning scheme is accurate and has convergence guarantees.
Theorem 4 (Projected Gradient Descent of Learning \boldsymbol{\mu}_{1}, \boldsymbol{\Sigma}_{1} ):
With careful customization of Bregman divergence (i.e., choosing the appropriate function
:\boldsymbol{\mu}_{1}\in \mathcal {R}^{M} and\Phi (\boldsymbol{\mu}_{1}) = \frac{1}{2}\Vert \boldsymbol{\mu}_{1}\Vert _{2}^{2} . The learning scheme is\dot{\Phi }^{-1}(\boldsymbol{\mu}_{1}) =\boldsymbol{\mu}_{1} \begin{equation*} \boldsymbol{\mu}_{1}^{(e+1)}= \boldsymbol{\mu}_{1}^{(e)}- \eta \nabla _{\boldsymbol{\mu}_{1}}L(\boldsymbol{\mu}_{1}^{(e)},\boldsymbol{\Sigma}_{1}^{(e)}). \tag{15} \end{equation*} View Source\begin{equation*} \boldsymbol{\mu}_{1}^{(e+1)}= \boldsymbol{\mu}_{1}^{(e)}- \eta \nabla _{\boldsymbol{\mu}_{1}}L(\boldsymbol{\mu}_{1}^{(e)},\boldsymbol{\Sigma}_{1}^{(e)}). \tag{15} \end{equation*}
:\boldsymbol{\Sigma}_{1}\succ 0 and\Phi (\boldsymbol{\Sigma}_{1}) = \text{tr}(\boldsymbol{\Sigma}_{1}\log \boldsymbol{\Sigma}_{1}-\boldsymbol{\Sigma}_{1}) . The learning scheme is\dot{\Phi }^{-1}(\boldsymbol{\Sigma}_{1})=\exp \boldsymbol{\Sigma}_{1} \begin{equation*} \boldsymbol{\Sigma}_{1}^{(e+1)}= \exp \left(\log \boldsymbol{\Sigma}_{1}^{(e)}- \eta \nabla _{\boldsymbol{\Sigma}_{1}}L(\boldsymbol{\mu}_{1}^{(e)},\boldsymbol{\Sigma}_{1}^{(e)})\right). \tag{16} \end{equation*} View Source\begin{equation*} \boldsymbol{\Sigma}_{1}^{(e+1)}= \exp \left(\log \boldsymbol{\Sigma}_{1}^{(e)}- \eta \nabla _{\boldsymbol{\Sigma}_{1}}L(\boldsymbol{\mu}_{1}^{(e)},\boldsymbol{\Sigma}_{1}^{(e)})\right). \tag{16} \end{equation*}
The learning process of
Besides the statistical properties (e.g., the covariance matrix is positive definite), smart meter data have physical properties as well due to grid operation. For example, because the standard range of voltage magnitude is between
\begin{align*}
\Phi (\boldsymbol{\mu}_{1}) =& \sum _{i=1}^{M}[(\mu _{i}+1.1)\log (\mu _{i}+1.1) \\
&+ (1.1-\mu _{i})\log (1.1-\mu _{i}) + \mu _{i}], \tag{17}
\end{align*}
To conclude, when post-outage distribution parameters
Timely Outage Identification With Performance Guarantee
In addition to the feasibility issue that has already been addressed in Theorem 4, the accuracy and computation time of the proposed learning scheme are two other concerns when implementing such a method for real-world outage identification. In this section, we demonstrate that our proposed method can achieve the optimal parameter solution with a guaranteed convergence. Furthermore, we present an efficient implementation for timely operation.
A. Restricted Convexity for Convergence Guarantee
While the non-convexity of likelihood
Definition 1:
A continuously differentiable function
Then, we show that
Theorem 5:
Using PGD to iteratively update
\begin{equation*}
L(\boldsymbol{\theta}_{i}^{\text{best}}) \leq L(\boldsymbol{\theta}_{i}^{\ast }) + \varepsilon \quad \text{and}\quad L(\boldsymbol{\theta}_{i}^{\text{avg}}) \leq L(\boldsymbol{\theta}_{i}^{\ast }) + \varepsilon,
\end{equation*}
The proof is in Appendix B. Moreover, since the best update converges faster as shown in Section VI, we choose it as the output of the learning scheme in Algorithm 1. To better visualize how the parameters are updated in the restricted convex area via Projected Gradient Descent (PGD), we provide Fig. 4. As we see, although the likelihood function
Visualization of learning parameters in
B. Acceleration for Timely Operation
Theorem 5 shows that our proposed method can find the optimal parameters with polynomial-time complexity, which enables quick operation. In this subsection, we provide an efficient implementation of the learning scheme to further accelerate the algorithm for timely outage identification. To achieve so, we notice that while the matrix exponential and logarithm operations in (16) provide good properties of covariance estimation, it is very time-consuming when calculating them. The calculation is time-consuming because it is often based on their infinite Taylor series. To accelerate these operations, we propose to use finite terms of their Taylor series to approximate the operations.
The matrix exponential is given by the power series in (18), and can be approximated by its first
Lemma 3:
The matrix exponential can be approximated as
\begin{align*}
\exp (\boldsymbol{X}) &= {\textstyle \sum }_{k=0}^{\infty }\frac{1}{k!}\ \boldsymbol{X}^{k} \approx {\textstyle \sum }_{k=0}^{K_{\exp }} \frac{1}{k!}\ \boldsymbol{X}^{k} := \widehat{\exp }(\boldsymbol{X}), \tag{18}
\end{align*}
Similarly, the matrix logarithm is given by the power series in (19), and can be approximated by the first
Lemma 4:
The matrix logarithm can be approximated as
\begin{align*}
\log (\boldsymbol{X}) \approx {\textstyle \sum \limits }_{k=1}^{K_{\log }} \frac{1}{k} (-1)^{k+1} (\boldsymbol{X} - \boldsymbol{I})^{k} := \widehat{\log }(\boldsymbol{X}). \tag{19}
\end{align*}
In order to make Theorem 4 still hold true when we use the approximated operation
In summary, the proposed two approximation operations
Validate on Extensive Outage Scenarios With Real-World Data
This section shows how our proposed method performs in various distribution grids with real-world data. To evaluate our method in systems with different sizes and environments, we design extensive experiments on IEEE 8-bus, IEEE 123-bus networks [31], as well as two European representative distribution systems: medium voltage (MV) network in the urban area and low voltage (LV) network in the suburban area [32]. In each network, bus 1 is selected as the slack bus.
To account for more complex outage scenarios in real-world distribution grids, we examine situations where alternative power sources are available after a line outage. In these scenarios, the “last gasp” notification is ineffective, making it more difficult to detect the line outage. We simulated the following two representative scenarios to replicate this complex scenario. It should also be noted that if certain buses are disconnected from the main grid and experience a voltage magnitude of zero following an outage, our method can accurately and quickly identify the out-of-service line. This is a simpler case compared to the ones we simulated below.
Mesh networks where most buses have non-zero voltages after the outage since they can receive power from alternative branches. Mesh network often depicts the outage scenario in urban areas. For simulating mesh networks, we add loops in each aforementioned network to ensure it is still connected after line outages, following the study in [3] and [25].
Radial networks with DERs where some buses still receive power from DERs though they are isolated from the main grid after an outage (see Fig. 1(a)). This outage scenario is typical in residential areas. To simulate DERs, we select multiple buses to have solar power panels with batteries as the storage. For the solar panel, we use the power generation profile computed by PVWatts Calculator [33].
For simulating more realistic data, we use the real residential power profile from Duquesne Light Company (DLC) in Pittsburgh, PA, USA. This profile contains anonymized and secure hourly (and 15-minute) smart meter readings of active power over more than 5,000 houses in the year 2016. The basic statistics of this dataset are summarized in Table I.
The time-series voltage data are simulated by the MATLAB Power System Simulation Package (MATPOWER) in MATLAB R2022b. In every distribution network, we assign active power
Due to the limited deployment of PMU in reality, the voltage phase angles are hard to obtain. Hence, as mentioned in Section II, we only use the voltage magnitude in the following experiments even though we model the voltage data in its phasor form. Another concern of data is the high dimensionality in large-scale grids. To resolve this computational issue, we apply the whitening transformation to our data as
In the subsequent experiments, we compare our proposed method with various baselines. When full knowledge of post-outage distribution
For more robust evaluation, each experiment will be conducted by the Monte Carlo simulation with over 1000 replications. In every replication, we randomly simulate outage time
A. Parameters Estimation With Accuracy and Convergence
Prior to demonstrating the accurate identification of outages with unknown post-outage distribution parameters, we must first verify that our method can learn the optimal parameters with a guaranteed convergence. Throughout the parameter learning iterations, we plot the Euclidean distance between the best update and the ground truth in Fig. 5. The plot indicates that our learning process converges to the ground truth, thereby verifying the convergence conclusion stated in Theorem 5.
B. Outage Detection With Small Delay and Rare False Alarm
After evaluating the effectiveness of using PGD to learn the unknown parameters, we then verify the performance of outage detection using such learned parameters.
The first criterion to evaluate our detection procedure is the average detection delay. To validate the asymptotic optimality of the detection delay in Theorem 2, in Fig. 6, we plot the average delay
Plots of the slope
The detection rule in Theorem 2 can also restrict the false alarm rate below maximum tolerance
Plots of the empirical false alarm rate against the theoretical probability of false alarm
In Table II, we present a summary of our proposed method's performance in various grid systems under different outage configurations. Our method demonstrates the ability to handle diverse outage scenarios in both mesh and radial networks with DERs penetration. Specifically, when
To compare with more relevant methods in the literature, we provide in Table III the detection performance of our proposed method and other methods. The comparison of average detection delay and false alarm rate shows that our method is only slightly degraded from the benchmark even though we have incomplete information, and outperforms other methods that also has incomplete information. The reason for this is our performance guarantee, as stated in Theorem 5, which ensures the accurate estimation of unknown post-outage distribution parameters. Furthermore, upon comparing our approach (PGD) with the machine-learning-based method (DCQ), we notice that the latter displays a greater variance in the average detection delay and false alarm rate. This can be attributed to the fact that the neural network's parameters are randomly initialized during training, leading to a more varied estimation of the unknown post-outage distribution parameters.
C. Analysis of Execution Time for Timely Operation
In addition to detecting delay and false alarm rate, the execution time of the proposed method is also critical for timely detection. Table IV presents the execution time of Algorithm 1 on various grid systems with different sampling rates.
From the records, less than 3 seconds per sample is needed to obtain the outage detection result when we receive a new sample, even for grid systems with more than 100 buses. This execution time can be negligible compared to the normal smart meter sampling interval, which ranges from 1 minute to 1 h. Moreover, since the most time-consuming part of our algorithm is the matrix exponential and matrix logarithm operation, we can accelerate the algorithm by approximating these operations based on their Taylor series expansion, as discussed in Section V-B.
To maintain the detection performance, we select an appropriate level of approximation with near-zero errors incurred. In Fig. 8, we choose
Ratio of saved execution time versus the ratio of error incurred by the operation
Table IV exhibits another phenomenon: as the sampling rate increases, the processing time for accumulated data
D. Outage Branch Localization With Accuracy
After detecting an outage occurrence, we further compute the conditional correlation between buses to localize the out-of-service branch, following Theorem 3. Here, Fig. 9 demonstrates the absolute conditional correlation of every pair of buses in the loopy 8-bus system before and after a line outage at branch 4-7. Since the value in the red box changes from a non-zero value before the outage (
Absolute conditional correlation of the loopy 8-bus system before and after an outage in branches 4-7. We choose
Table V demonstrates the accuracy rate of localization in 1,000 experiments. As shown, our proposed method can accurately localize over 90% of the outage branches, even without the post-outage distribution parameters.
E. Sensitivity Analysis to Data Noise and Data Coverage
Smart-meter data can be noisy and corrupted. Besides, smart-meter data may not be accessible in every household of the distribution grid. Thus, an analysis of our proposed method under different levels of data noise and data coverage is critical to gain a better understanding of its effectiveness in real-world outage scenarios.
In the U.S., ANSI C12.20 standard permits the utility smart meters to have an error within
Another concern regarding the smart meter data is that it may not be accessible for every household in the distribution grid, particularly in certain situations. For instance, 1) in rural areas, some households may not have installed smart meters, 2) the voltage data for certain households may be lost due to technical issues, and 3) some households may refuse to provide their voltage data due to privacy concerns. Although the new generation of smart meters is developing very fast, an analysis of incomplete coverage of smart meters data is needed to evaluate our algorithm. We first emphasize that our proposed method does not rely on the assumption of 100% coverage of smart meters data in the grid. In fact, a power line outage will influence almost all buses in the system, while the degree of influence depends on the distance between a bus and the source of the outage. Hence, we can reveal the outage by detecting the distribution change of some (not necessarily all) voltage data collected nearby the outage source.
According to [40], over 107 million smart meters were deployed by 2021, covering 75% of U.S. households. Hence, we simulate this scenario where only a ratio of buses is randomly selected to provide its voltage measurements in the grid system to detect the outage. Fig. 10 demonstrates both the average detection delay and the false alarm rate of our method under different levels of coverage ratio. In comparison to the scenario where voltage data is available for all buses, the detection delay increases by 1.2 units of time step. This means that an extra 1.2 samples of data are needed to detect the outage in the 75% data coverage scenario. Similarly, when the data coverage ratio drops to 50%, an additional 6.9 samples are required for detection. Furthermore, as the data coverage ratio decreases to only 50%, the false alarm rate increases from 0.7% to 21.9%.
Average detection delay (unit) and false alarm rate (%) under different levels of data coverage in loopy 123-bus system,
F. Sensitivity Analysis to Hyper-Parameters
Our detection procedure involves certain hyper-parameters that have the potential to influence the detection performance, such as the geometric distribution parameter
During our experiments, we randomly simulated the outage time
Average detection delay (unit), false alarm rate (%), and localization accuracy (%) under different levels of
Limitations
While this article has some performance guarantee, we also encounter some of the limitations that we look forward to address in the future. For instance, while the proposed approach requires only voltage magnitude data, it may be limited by the quality and availability of this data. As shown in Section VI-E, noise or incomplete data will lead to additional detection delay. Future research could investigate how to leverage additional types of data to improve outage detection and localization. Another aspect worth investigating is the ability to withstand diverse outage scenarios. For instance, if an outage occurs in an insignificant branch of the grid, resulting in minimal fluctuations in voltage data, detecting such subtle outages remains a challenge. Hence, further research is necessary to improve the detection performance in such cases. Lastly, although sensor readings facilitate line outage detection, they pose privacy concerns since they can disclose sensitive information like household occupancy and economic status to potential adversaries. An open problem is how to identify outages accurately without compromising the customer's data.
Conclusion
This article resolves three challenges in the line outage identification problem: data availability, unknown outage pattern, and timely operation. Our approach for detecting and localizing line outages only utilizes voltage magnitude. To handle unknown outage patterns, we propose a Projected Gradient Descent framework that can learn the unknown post-outage distribution parameters with a feasibility guarantee. We demonstrate the convergence guarantee of our method and further accelerate the proposed algorithm for timely operation, resulting in a reduction of more than 75% of execution time with minimal errors. Empirical results on representative grid systems confirm that our proposed method is suitable for timely outage detection and localization, even in the absence of prior knowledge about outage patterns.
Proof of Lemma 1
Proof:
In (1), the incidence matrix
\begin{equation*}
\boldsymbol{A}_{\mathcal {E}, \mathcal {G}}^\top \boldsymbol{Y}_\mathcal {E}\boldsymbol{A}_{\mathcal {E}, \mathcal {G}}= \boldsymbol{A}_{\mathcal {E}, \mathcal {G}}^\top \boldsymbol{B}^\top \boldsymbol{B} \boldsymbol{A}_{\mathcal {E}, \mathcal {G}}= (\boldsymbol{B} \boldsymbol{A}_{\mathcal {E}, \mathcal {G}})^\top (\boldsymbol{B} \boldsymbol{A}_{\mathcal {E}, \mathcal {G}}).
\end{equation*}
Proof of Theorem 5
Proof:
Since
\begin{equation*}
L_{e} = L(\boldsymbol{\mu}_{1}^{(e)}) - L(\boldsymbol{\mu}_{1}^{\ast }) \leq \langle \nabla L(\boldsymbol{\mu}_{1}^{(e)}), \boldsymbol{\mu}_{1}^{(e)}- \boldsymbol{\mu}_{1}^{\ast } \rangle. \tag{20}
\end{equation*}
\begin{align*}
&\frac{1}{2\eta }\left(\Vert \boldsymbol{\mu}_{1}^{(e)}- \boldsymbol{\mu}_{1}^{\ast }\Vert _{2}^{2} + \eta ^{2}\;U^{2} - \Vert \boldsymbol{\mu}_{1}^{e+1} - \boldsymbol{\mu}_{1}^{\ast }\Vert _{2}^{2} \right), \tag{21}
\end{align*}
\begin{equation*}
L_{e} \leq \frac{1}{2\eta }\left(\Vert \boldsymbol{\mu}_{1}^{(e)}- \boldsymbol{\mu}_{1}^{\ast }\Vert _{2}^{2} - \Vert \boldsymbol{\mu}_{1}^{e+1} - \boldsymbol{\mu}_{1}^{\ast }\Vert _{2}^{2} \right) + \frac{\eta U^{2}}{2},
\end{equation*}
Then, we prove that the averaged and best update both converge to