Introduction
Electronic devices and systems are subjected to thermal, electrical and mechanical stresses on the field and hence, the reliability of these devices is of utmost concern. Prognostics and health management (PHM) for electronic systems aims to detect, isolate and predict the onset and source of system degradation as well as the time to system failure [1]. In general, prognostic algorithms are categorized as model-based and data-driven methods. Model-based or physics-based methods (as they are often referred to interchangeably) use an accurate degradation model curated for the specific system/component under pre-defined environmental and operating conditions. The prediction accuracy of the degradation model gets compromised due to the variability in the environment/operating conditions or if an individual system/component tends to follow a significantly different degradation trend compared to the rest of the lot due to intrinsic or extrinsic factors that are not fully attributable. Commonly used model-based methods include Kalman filters (KF) [2], particle filters (PF) [3] and adaptations of the aforementioned methods [4]–[7].
Data-driven methods are alternate approaches to model-based methods as they identify degradation trends in the available degradation (start to end failure) data and use it for prognosis. Typical data-driven methods include support vector machine (SVM) [8], relevance vector machine (RVM) [9], neural networks (NN) [10]–[12], Gaussian process regression (GPR) [13]–[15] etc. Though a powerful approach, data-driven methods elevate computational complexity as they require large amount of failure data incorporating all the possible failure modes and operating conditions.
With emerging technologies and advancement in manufacturing processes, new devices/components are being developed to cater to these emerging needs. The major challenges inhibiting reliability studies on such newly developed devices/systems are the lack of sufficient failure data and also, the lack of full-fledged physics-of-failure models. To address the generalization problem, hybrid approaches combining model-based and data-driven methods are being widely used in the recent past. Hybrid prognostic approaches have an upper edge on such new devices as they neither require an accurate degradation model nor a large amount of training data for the purpose of remaining useful life (RUL) estimation. In other words, they make best use of partial knowledge and sparse data available for the new device/component under prognostic investigation.
Hybrid/Fusion Prognostics has become a research hotspot in the recent past. Wang et al. [16] used RVM for sparse representation of the degradation data along with the use of an empirical degradation model for predicting the RUL of the rolling element bearings. Chang and Fang [17] used RVM to determine the measurement noise in lithium-ion battery capacity degradation dataset and apply it for parameter estimation in a PF algorithm. Similarly, Song et al. [18] combined an autoregression (AR) model with PF for RUL estimation on spacecraft lithium-ion battery dataset from the NASA repository. Sun et al. [19] used the extreme learning machine (ELM) approach to construct a degradation model for battery degradation data and used it as the “measurement function” in PF state-space formulation algorithm for RUL estimation.
From the above-mentioned methods, it is evident that PF based approaches are promising for the purpose of RUL estimation. The reason being the ease of its applicability to highly non-linear systems along with non-Gaussian noise present in it. There have also been attempts to use a suitable surrogate data-driven model as the state transition function in PF algorithm over empirical/phenomenological models. The surrogate model formulation can be done using statistical curve fitting methods such as auto regressive integrated moving average (ARIMA). However, ARIMA methods are more suited for linear trends and for short-time horizon predictions. On the other hand, artificial neural networks outperform other statistical approaches due to its versatility to map complex input-output relationships and its ability to identify hidden degradation patterns in the system failure data. One of the first attempts to combine PF and NN for the purpose of RUL estimation was proposed by Baraldi et al. [20] back in 2013. The authors developed an ensemble NN model which creates a large amount of training datasets. These data are then converted into analytical models by data mining techniques and substituted into a PF algorithm as the measurement and state transition functions. The authors succeeded in overcoming the need of using an empirical model in the PF algorithm though the computational complexity and load was enormous compared to conventional methods. Also, the accuracy of the proposed method primarily depended on the amount of training data simulated by the ensemble NN model.
Sbarufatti et al. [21] proposed a self-learning adaptive algorithm to improve the approach proposed in Ref. [20]. The authors proposed a hybrid prognostic algorithm where the radial bias function (RBF) neural network was used as the degradation model for prognosis of lithium-ion batteries. The kernel parameters of the neural network model were estimated through the PF algorithm. Further, the proposed damage model was projected in future to predict the battery capacity and the remaining useful life. The discharge curves of 5 batteries from NASA’s repository were used in this work. The data with the longest curve corresponding to a pristine battery was used for training. Even though the prediction results of the proposed method were promising, the algorithm was tested on a dataset with a slow but progressive aging dynamics. This puts doubt on the adaptability of the algorithm on degradation data due to accelerated aging and unforeseen sudden changes in the degradation behavior.
Since the RBF networks are restricted to one hidden layer and linear output activation functions, the authors further extended their work to adopt a multilayer perceptron (MLP) NN model and tested the approach on NASA and CALCE datasets for estimating the RUL [22]. The authors used an MLP network with 5 hidden neurons to construct the degradation model. The NN degradation model was used in the PF algorithm for predicting capacity degradation. The authors tested the same method on fatigue crack growth data [23] as well. For crack growth data, the algorithm was trained with constant-amplitude degradation curve and tested on changing-amplitude degradation data. The prediction accuracy was poor in the early degradation stages and improved when more than 50% of degradation data was available for prognosis. Wu et al. [24] introduced a bat algorithm for resampling the particles in PF algorithm to improve the performance of MLP + PF hybrid approach. In all of the above-mentioned methods, NN model was used as the damage evolution model in the PF framework and the network parameters were solely dependent on the training dataset. No information from the test dataset was plugged into the algorithm and hence a large amount of data from the test data was required for the algorithm to identify the damage evolution trend.
In this work, we propose an intelligent and adaptive hybrid approach where PF based state estimation is used to warm-start a NN model. The informed stochastic parameter initialization helps to overcome the generalization issues faced by other hybrid approaches in literature. Secondly, the PF optimized NN model is trained with available information from the test dataset using a Levenberg Marquardt (LM) algorithm to find the optimal network parameter values which can be used for prediction of the future states and the RUL as well. The proposed approach was tested on NASA and CALCE lithium-ion batteries capacity degradation datasets and the RMSE values were used as the performance metrics. Our work moving forward is organized as follows. In Section II, the standard NN and PF approach for prognosis are explained and the prediction results using these conventional methods are presented in Section III. In Section IV, the proposed hybrid PF based NN approach is introduced and the prediction results for both NASA and CALCE dataset at different prediction starting points using the proposed hybrid framework are examined in Section V. Also, the impact of the network architecture is discussed as well. Finally, the conclusions of the study and possible scope for future work are summarized in Section VI.
Neural Network and Particle Filter Methodologies
A. Feedforward Neural Networks (FFNN)
In this work, the neural network architecture chosen is a Multi-Layer Perceptron (MLP) with \begin{equation*} h_{i}=\frac {1}{1+e^{-(w_{i}^{(1)}\ast k+b_{i}^{(1)})}}\tag{1}\end{equation*}
\begin{align*} g\left(\left ({w,b }\right),k\right)=f\bigg(\sum \limits _{i=1}^{M} {\left(h\left ({\left ({k\ast w_{i}^{\left ({1 }\right)}+b_{i}^{\left ({1 }\right)} }\right)w_{i}^{\left ({2 }\right)} }\right)+b_{i}^{\left ({2 }\right)}\right)} \\\tag{2}\end{align*}
B. Standard Particle Filter
Particle filters are sequential Monte Carlo (SMC) methods and work on the concept of recursive Bayesian method for state estimation. In this work, the PF algorithm is used for estimating the NN model parameters recursively for a given a set of observations/test data. Ideally, PF algorithm employs empirical or physics-based models for state estimation but in this work, we use the NN degradation model shown in Eqn. (2). Therefore, the state space representation for the system can be expressed as:\begin{align*} x_{k}=&x_{k-1}+\omega _{k-1} \tag{3}\\ z_{k}=&{g(x}_{k},k)+\varepsilon _{k}\tag{4}\end{align*}
\begin{equation*} L\left ({x_{k}\thinspace \vert \thinspace {\Theta }_{k}}\right) =\frac {1}{\sqrt {2\pi } \sigma _{j}}exp\left [{ \frac {-1}{2}\left ({\frac {z_{k}-x_{k}^{j}(\Theta _{k}^{j})}{\sigma _{k}^{j}} }\right)^{2} }\right]\tag{5}\end{equation*}
\begin{equation*} p\left ({x_{k}\thinspace \vert \thinspace z_{1:k}}\right) =\mathrm { }\sum \limits _{j=1}^{N_{s}} s_{k} \delta (x_{k}-x_{k}^{j})\tag{6}\end{equation*}
\begin{equation*} w_{k}=w_{k-1}p(z_{k}\vert x_{k}^{j})\tag{7}\end{equation*}
Comparison Between FFNN and Standard PF for RUL Estimation
A. Battery Degradation Datasets
Two different sets of battery degradation data from different laboratory setups are used in this work to evaluate the performance of the proposed hybrid algorithm.
1) CALCE Dataset
Four LiCoO2 prismatic cells with a rated capacity of 1.1Ah were subjected to degradation. These cells underwent a constant current/constant voltage charging protocol with a constant current rate of 0.5C until the voltage reached 4.2V. The batteries were sustained at 4.2V till the charging current dropped below 0.05A. The failure threshold for these batteries were set to be 0.88Ah. All four batteries showed similar capacity degradation trends; hence we chose CS-36 for training and CS-37 and CS-38 for the purpose of testing the proposed hybrid algorithm (HyA). The capacity degradation curve versus the charge/discharges cycles of the three batteries considered in this work are shown in Fig. 1(a).
The typical battery capacity degradation curves from the (a) CALCE dataset (b) NASA dataset for LiCoO2 cell technology.
2) NASA Dataset
For the second dataset, we chose to use the degradation data of 18650 LiCoO2 batteries from the NASA repository. The rated capacity of these batteries is 2.1Ah and unlike the CALCE dataset, these batteries were cycled under random currents rather than constant discharge currents. The battery capacities were measured after every 1500 periods and the failure threshold was set to be 1Ah. The capacity degradation curve versus the charge/discharges cycles is shown in Fig. 1(b). The labels of batteries used as training and test datasets from both CALCE and NASA repository are shown below in Table 1.
B. RUL Estimation Using FFNN or Standard Pf
In this section, the prediction results of the standard FFNN model and standard PF approach are compared. The battery CS-36 of CALCE dataset and RW9 of NASA dataset were used to train the neural network model with 3 hidden neurons. The number of hidden neurons greatly affects the performance of the NN model. However, there are not any definitive methods available in literature for the optimal selection of hidden neurons. The selection methods proposed in literature were either specific to the system/device considered or was specific to the type of NN architecture used for their study. Also, the number of NN weight and bias parameters for 5–6 hidden neurons scales up exponentially which can also result in overfitting of the time series degradation patterns and thereby lead to convergence issues and multiple local optima solutions. Hence, we adopted a trial-and-error method based on Bayesian Information Criterion (BIC) to fix the number of hidden neurons suitable for our study [25], [26].
The trained NN model was used to predict the capacity values of test data sets, RW10 and CS-37. The prediction results are shown in Fig. 2(b) and 2(d), respectively. The trained NN model was executed for 50 repetitions, of which only 3 repetitions were able to trace the actual degradation trend for RW10. A particular repetition was considered to be successful if the prediction trace lies within the
The prediction curves using standard PF algorithm for (a) Battery RW10 of NASA dataset and (c) Battery CS-37 of CALCE dataset. The prediction curves based on the standard feedforward NN model for (b) Battery RW10 of NASA dataset and (d) Battery CS-37 of CALCE dataset. The gray lines indicate the prediction traces for 5000 particles (in the case of PF) and 50 repetitions for NN.
To address the limitations of the FFNN, a PF algorithm with NN degradation model as the measurement equation was analyzed. This approach is similar to the hybrid approached used in [21]–[24], as described earlier in Section I. The curve fitting results for RW9 and CS-36 using NN model were used as the initial parameter guess for the PF algorithm. The prediction traces for RW10 and CS-37 for 5000 particles are shown in Figs. 2(a) and 2(c), respectively. The degradation model with 3 hidden neurons is expected to be capable of capturing the non-linearity in the degradation trend. However, PF algorithm is unable to track the actual degradation trend. The prognostic performance can be improved by using more complex NN architecture or by using meta-heuristic algorithms for resampling weighted particles in the PF algorithm. However, those approaches increase the computational load and complexity tremendously and employing such techniques in real-time would be challenging. This calls for the need of an adaptive hybrid approach for RUL estimation for systems wherein the network architecture is simple yet powerful enough to capture highly non-linear degradation trends.
Proposed Hybrid Prognostic Framework
The proposed hybrid particle filter trained neural network framework (HyA) is shown in Fig. 3. The proposed method can be split into three stages – (A) Degradation Model Formulation and Parameter Initialization, (B) Bayesian State Estimation and (C) Neural Network Prognosis with Bayesian Posterior Weights and Bias. To begin with in Stage A, one of the batteries run-to-failure data from each dataset was chosen as the training dataset. In this case, CS-36 and RW9 were chosen as the training datasets. The curve fitting results for NN model by varying the number of hidden neurons (2 to 6) were obtained. The Bayesian information criterion (BIC) was evaluated and used as the deciding factor for network model selection. This is because the BIC is a robust metric for model selection which penalizes the use of a model with too many fitting parameters for any given data set. The NN architecture model with the lowest BIC value was chosen for the purpose of this study. The BIC can be expressed as \begin{equation*} BIC=kln\left ({n }\right)-2\mathrm {ln}(\hat {L})\tag{8}\end{equation*}
The schematic of the proposed three-stage hybrid particle filter based neural network algorithm (HyA).
The prediction curves of RW10 for the proposed hybrid algorithm (HyA) with prediction starting point at (a) 30 cycles (b) 60 cycles and (c) 90 cycles. The gray lines represent the prediction traces for 50 repetitions of HyA.
In Stage B, the entire training dataset is fed into the PF algorithm as measurement data. The NN degradation model-based state transition function, shown in Eqn. (3) and Eqn. (4) are used to estimate the network parameters. The first two stages of the framework constitute the training phase. In Stage C, a FFNN model with 3 hidden neurons was configured using the test dataset. As mentioned earlier, the choice of using 3 hidden neurons was based on the BIC analysis. Batteries CS-37, CS-38 and RW10 were the test datasets used for state prediction and RUL estimation. The network parameters estimated by the PF algorithm in Stage B are used to configure the initial weights and biases of the FFNN model. This approach helps to warm start the neural network training for the test dataset. As warm starting the neural network helps to restrict the network parameters closer to the optimal values, the prediction accuracy is expected to be higher. The PF algorithm in the training phase is executed with 5000 particles and the predicted network parameter values with
The FFNN model in Stage C was executed for 50 repetitions with 50 different initial configuration of weights/bias values. The FFNN model is now further trained using the available test data (using the training data led PF based posterior weights and biases as the starting values of the NN) for the test dataset with 50 different initial weight/bias values. LM algorithm was opted for training the network parameters. The trained model was used to predict the future battery capacity till the end-of-life and hence estimate the remaining useful life of the batteries.
Results and Discussions
A. RUL Prediction for NASA Dataset Using HyA
The prediction results for battery RW10 at three different prediction starting points i.e., 30, 60 and 90 cycles are shown in Figs. 4(a), 4(b) and 4(c) respectively. At 30 cycles, with just few cycles of data available in the test data, the proposed HyA is able to capture the degradation trend including the inflection at the \begin{equation*} RMSE=\sqrt {\frac {1}{n}\sum \limits _{k=T}^{n} d_{k}^{2}}\tag{9}\end{equation*}
\begin{equation*} d_{k}=x_{predicted}-x_{true}\tag{10}\end{equation*}
The RUL distribution for Battery RW10 of NASA dataset at three different prediction starting points –
Comparison of the RMSE values (mean value of 50 repetitions) between the standard PF algorithm, FFNN and the proposed hybrid algorithm (HyA) are listed in Table 3. The RMSE values clearly show that the proposed hybrid algorithm performs better than the other two conventional methods. Despite the low success rate of 66% at 60 cycles due to the sharp inflection point, the RMSE value is better, clearly indicating the robustness of the proposed hybrid approach.
B. RUL Prediction for Calce Dataset Using HyA
For the CALCE dataset, battery CS-36 was used for training the network parameters. The failure threshold for these batteries were set at 0. 88Ah. The prediction results at
The prediction curves of CS-37 for the proposed hybrid algorithm (HyA) with prediction starting point at (a) 120 cycles (b) 185 cycles and (c) 250 cycles. The gray lines represent the prediction traces for 50 repetitions of HyA.
The prediction curves of CS-38 for the proposed hybrid algorithm (HyA) with prediction starting point at (a) 120 cycles (b) 185 cycles and (c) 250 cycles. The gray lines represent the prediction traces for 50 repetitions of HyA.
The RUL distribution for battery CS-37 & CS-38 of CALCE dataset at three different prediction starting points –
For CS-37, the width of RUL pdf for all three prediction starting points are about 240 cycles indicating very good performance of the proposed approach. At 250 cycles, 42 out of the 50 predicted traces did not cross the failure threshold as shown in Fig. 6(c). However, for the remaining 8 successful repetitions, the RUL error between the predicted RUL and true RUL was found to be less than 1%. On the other hand, the width of the RUL pdf for CS-38 at the
Similar results were observed for the fourth battery dataset as well (CS-39) but for the sake of brevity, the results are not discussed here. With the availability of a greater amount of test data, the prediction accuracy improves and the RUL pdf width at the
C. Choice of Network Architecture
A good choice of network architecture is essential for enhancing the prediction capabilities of the NN model. An optimal number of hidden neurons is essential for improved mapping of the complex input/output relations. In this work, the choice of hidden neurons was done by a trial-and-error procedure using BIC values. A lower number of neurons would inhibit the NN model to capture the nuances in the degradation trend and therefore, the analysis for a single hidden neuron was omitted as it would oversimplify the trend. The BIC results for different number of hidden neurons are shown in Table 2. The model with the minimum BIC value was considered as the best suited model for the purpose of this study. Choosing a very high number of hidden neurons may result in overfitting issues. The prediction results for 4, 5 and 6 hidden neurons are shown in Fig. 9(a), Fig. 9(b) and Fig. 9(c) respectively. It can be seen from Fig. 9(b) that the HyA tries to mimic the kink in the true data at 60 cycles and eventually distorts the prediction trend. Similar behavior was observed for 6 hidden neurons as well. The RMSE values as well as computational time for predictions against number of hidden neurons are listed in Table 5. From Table 5, it is evident that 3 hidden neurons (with minimum BIC value) is the optimal count for the handling the complexity in the considered battery degradation datasets in this study. Since 3 hidden neurons would require optimizing 9 parameters over 13 parameters for 4 hidden neurons, the computational load was found to be comparatively less. Thus, the choice of 3 hidden neurons was eventually found to be a fair compromise between low error values and less computational time as well. Moreover, if we went with the option of a higher neuronal count, then we would end up risking the chances of particle degeneracy or impoverishment in the PF as well.
The prediction curves of RW10 for the proposed hybrid algorithm (HyA) with (a) 4 hidden neurons (b) 5 hidden neurons and (c) 6 hidden neurons. The gray lines represent the prediction traces for 10 repetitions of HyA.
D. Computational Time
One of the major challenges with developing hybrid approaches is to make sure that the computational time is not compromised for better prediction accuracy. Hence, we chose to analyze the execution time for the proposed algorithm along with PF and FFNN approaches. The standard PF approach with 5000 particles took about 8.12 seconds for execution and a standard FFNN model took about 48.36 seconds for executing 50 repetitions. For the proposed hybrid algorithm (HyA), the execution time was found to be 53.14 seconds for 50 repetitions. The computational time for HyA is the cumulative time taken for executing Stage B and C shown in Fig. 3. Therefore, it can be concluded that the proposed hybrid approach is not only effective in terms of prediction accuracy but also in terms of computational load. It is to be noted that the simulations were executed in a standard DELL® desktop workstation (Model-Inspiron 14 – 5459) with 16GB RAM and Intel Core i5 processor.
E. Comparison of Prediction Results With Existing PF + NN Hybrid Prognostic Framework
There are few hybrid prognostic methodologies proposed in literature combining PF and NN. However, most of those research works were tested on datasets different from the battery degradation datasets considered in this work such as battery voltage discharge curves, crack propagation datasets etc. However, Ref. [24] shows the prediction results for CALCE battery CS-37 and NASA RW11 battery. In Ref. [24], Wu et al. have used the NN model equation as the state transition function in the PF framework and subsequently PF algorithm was used for the purpose of RUL estimation. The authors have performed curve fitting on each of the battery dataset considered for their study and the corresponding curve fitting parameters were fed into the PF framework as the initial parameter guess values. The prediction results comparing Ref. [24] and our proposed HyA are shown in Fig. 10(a) and Fig. 10(b) for CS-37 and RW11 respectively. It is to be noted that the blue curve representing HyA in Fig. 10 is the mean value of predictions for 50 repetitions. The extracted RMSE values for the prediction results of Ref. [24] was found to be 0.3686 and 0.2698 for RW11 and CS-37 respectively. This is much higher than the RMSE values for the same data set extracted using our framework as shown in Table 6 below. It is evident from the results that our proposed HyA framework outperforms the prediction results of Wu et al. [24] despite their impractical assumption that the entire test dataset information is available apriori for curve fitting. It can thus be concluded that our proposed method is robust, more realistic and highly adaptable.
Conclusion
In this work, a hybrid particle filter based neural network model is proposed for RUL prediction of lithium-ion batteries. A neural network model with one hidden layer containing 3 hidden neurons and a sigmoid activation function at the input side and linear activation function in the output side was used as the damage evolution model in the particle filter framework. The novelty of the proposed method lies in using PF algorithm for estimating the posterior distributions of the network weight and bias parameters corresponding to the training dataset and utilizing those parameters to warm start an MLP network model for the test dataset. The MLP model is further trained to optimize the network parameters for the test dataset. The trained MLP model is used to predict the future battery capacity values and remaining useful life. The proposed method was tested on NASA and CALCE datasets. The prediction results were compared with standard PF and FFNN methods. It was evident that the proposed approach was versatile enough to use the same NN architecture for both NASA and CALCE even though their degradation patterns are vastly different. The versatility of the proposed method makes it an appropriate choice for reliability studies on newer systems. Also, the proposed methods have very good prediction accuracy with very low RMSE values and is efficient in terms of computational time as well compared to the two conventional methods.
For future work, we intend to test the algorithm on LED luminosity degradation datasets where there are three distinct degradation phases and would be challenging for any prognostic algorithm to model and predict such highly non-monotonic degradation phenomena [27]. Also, the impact of different resampling strategies used in PF algorithm on RMSE values and computational time would be explored in future. The proposed approach can also be modified to include physics-informed surrogate models over FFNN models which would have the capability to encode underlying physical laws in a given dataset.