Journals & Magazines >IEEE Access >Volume: 11

A New Approach for Forecasting Crude Oil Prices Based on Stochastic and Deterministic Influences of LMD Using ARIMA and LSTM Models

A new approach for forecasting crude oil prices based on stochastic and deterministic influences of Local Mean Decomposition Using Auto Regressive Integrated Moving Avera...

Abstract:

Crude oil is one of the non-renewable power sources and is the lifeblood of the contemporary industry. Every significant change in the price of crude oil (CO) will have a...Show More

Metadata

Abstract:

Crude oil is one of the non-renewable power sources and is the lifeblood of the contemporary industry. Every significant change in the price of crude oil (CO) will have an effect on how the global economy, including COVID-19, develops. This study developed a novel hybrid prediction technique that depends on local mean decomposition, Autoregressive Integrated Moving Average (ARIMA), and Long Short-term Memory (LSTM) models to increase crude oil price prediction accuracy. The original data is decomposed by local mean decomposition (LMD), and the decomposed components are reconstructed into stochastic and deterministic (SD) components by average mutual information to reduce the computation cost and enhance forecasting accuracy, predict each individual reconstructed component by ARIMA, and integrate the residuals with LSTM to capture the nonlinearity in residuals and help to find the final prediction result. The new hybrid model LMD-SD-ARIMA-LSTM has reduced the volatility and solved the issue of the overfitting problem of neural networks. The proposed hybrid technique is validated using publicly accessible data from the West Texas Intermediate (WTI), and forecast accuracy are compared using accuracy measures. The value of Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) for ARIMA, LSTM, LMD-ARIMA, LMD-SD-ARIMA, LMD-ARIMA-LSTM, LMD-SD-ARIMA-LSTM, and Naïve are 1.00, 1.539, 5.289, 0.873, 0.359, 0.106, 4.014 and 2.165, 1.832, 9.165, 1.359, 1.139, 1.124 and 3.821 respectively. From these results, it is concluded that the proposed model LMD-SD-ARIMA-LSTM has minimum values for MAE and MAPE which assured the superiority of the proposed model in One-step ahead forecasting. Moreover, forecasting performance is also compared up to five steps ahead. The findings demonstrate that the suggested approach is a helpful tool for predicting CO prices both in the short and long term. Furthermore, the current study reduces labor costs by combing the stationary and non-...

A new approach for forecasting crude oil prices based on stochastic and deterministic influences of Local Mean Decomposition Using Auto Regressive Integrated Moving Avera...

Published in: IEEE Access ( Volume: 11)

Page(s): 14322 - 14339

Date of Publication: 08 February 2023

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2023.3243232

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

As the “lifeblood of the industry,” crude oil (CO) is the most significant strategic component raw element in contemporary industrial society, the key to prosperity and national security, and the cornerstone of civilization [1]. It is connected to how the global economy is growing, and different economic data will be more significantly impacted by changes in oil prices. Predicting how the price of crude oil will fluctuate in the future is therefore quite important. Scholars have used a range of research techniques to perform in-depth analyses and forecasts of global oil prices from the perspective of diversification. The artificial intelligence model and the econometric model, and the integrated forecasting model are roughly the three components of these research methodologies, according to the summary of the literature examined. For the models in econometrics The authors in [2] examined how well various ARIMA-GARCH (Generalized Autoregressive Conditional Heteroscedasticity) models performed in simulating and predicting the conditional mean and volatility of weekly crude oil prices. The researchers in [3] apply the findings of the ARIMA model compared with those of the Decomposition-based vector autoregressive model (DVAR), which is used to forecast the monthly price data for WTI crude oil. The authors in [4] examined the predictive power and impact of the Google index on CO price by incorporating it into the ARMA-GARCH and ARIMA models. By contrasting the suggested MMGARCH (Mixture Memory GARCH model) to other discrete unpredictable models. Klein and Walther [5] extend the literature on predictable and esteem predicting CO price returns. Stelios et al [6] compare the VAR model’s capacity for forecasting to that of the Random Walk (RW) model and the AR model. The results are listed at the top of Table 1. Generally, the economic model assumes that the data are stable, regular, and linear. Under this assumption, the economic model can accurately predict the CO price. The international crude oil market, however, exhibits complex non-linear, and multidimensional characteristics of crude oil price movements. The intricate features concealed in crude oil may be too complicated for these conventional metering methods to detect. Support vector regression (SVR), Artificial Neural Network (ANN), Random Forest (RF), and other widely used non-linear models are applied to CO price forecasting, successfully fitting the non-linear CO price series as a result of the rapid growth of artificial intelligence. For example, the researchers in [7] utilized a neural network and genetic algorithm to predict the price of WTI CO. In the same way, the authors in [8] utilized a neural network to forecast the price term structure of crude oil futures. Fan et al [9] utilize an Imperialist Competitive Algorithm and Support Vector Regression (ICA-SVR) techniques to forecast the price of crude oil. Mostafa and EI-Masry [10] projected CO price using gene expression programming (GEP); CO price is forecasted by Gao and Yalin [11] based on stream learning. The artificial intelligence model outperformed the conventional paradigm, in line with empirical research. A single artificial intelligence (AI) model cannot correctly represent the dynamic changes of complicated CO price time series responsible for the significant variations in the time series. However, an AI model can accurately anticipate non-linear and non-stationary sequences. However, the hybrid forecasting model overcomes the drawbacks of time series instability and nonlinearity and enhances the CO price prediction accuracy by combining a range of methodologies. In the past few years, integrated models for predicting the price of CO have developed quickly. Tao et al [12] suggested a more effective EMD-SBM-FNN model that can capture the intricate dynamics of the price of crude oil. Zhang et al [13] introduced EEMD-PSO-LSSVM-GRACH a novel hybrid approach to forecast CO prices. Yu et al [14] used the EEMD-DCD-LSSVR model to predict the price of CO. The authors in [15] estimated the price of CO, by using bootstrap aggregation (bagging) and Stacked Denoising Auto Encoders (SDAE). In the same way, the authors in [16] use the EEMD-RVFL model to predict the price of CO. Moreover, the authors in [17] used the EEMD-EELM-ADD model as a unique decomposition-ensemble technique for predicting CO prices. Ding [18] created a hybrid model EEMD-ANN-ADD for predicting the price of CO. The authors in [19] use the DFN-AI model to predict the CO price. Similarly, the authors in [20] use the VMDICA-ARIMA hybrid model to predict the price of CO. In the same way, Zhang et al [21] suggested an algorithm for iterated combinations to predict the CO price. The authors in [22] combine RW and ARMA to predict the CO price. Similarly, Zheng et al [23] proposed EEMD and Dynamic Artificial Neural Network (DANN) to forecast the CO price. The authors in [24] showed load prediction, based on the long short-term memory (LSTM) model, based on Back Propagation Neural Network (BPNN) and Local Mean Decomposition (BPNN-LMD-LSTM). The design is based on a fixed-time consistency algorithm with random delay to predict the economic dispatch of microgrids. The authors in [25] proposed a landslide displacement prediction model, the local mean decomposition-bidirectional long short-term memory (LMD-BiLSTM), which depends on the time-frequency analysis method. The authors Heng Sun [26] utilizes method in three steps exhibits great potential applications in the RUL prediction of rotating machines. The authors in [27] LSTM, wavelet threshold denoising (WTD), and LMD have been integrated into a novel combined model called LMD-WTD-LSTM to estimate short-term gas consumption. In the same way, the authors in [28] introduced a new model which enhanced the accuracy of the predictions. The novel technique called variational mode decomposition (VMD) and used to predict the major factor time series utilizing its secondary factors. A new technique called multiscale forecasting model is introduced that produced an optimal forecast [29]. This model outperformed the compared model to forecast the complex time series data. In the same way, the authors in [30] decomposed the data into many features via VMD. Then the mutli-features are trained with machine learning classifiers. The authors in [31] forecasted the Daily PM2.5 and PM10 data employing a Robust LMD (RLMD) and moving window ensemble technique was done using linear and nonlinear modelling frameworks. The research mentioned above claims that the hybrid model mixes single models so that the benefits of each model balance out the drawbacks of the other models. As a result, the hybrid model is superior to the single model and offers us research suggestions. From the above discussions the following research questions have been generated. How can the end-point impact be eliminated due to the complicated dynamic change of the crude oil price time series and additional information obtained on different frequencies of the crude oil price data itself? How the calculations for the hybrid model may be streamlined. How can CO price predictions be made more accurately? In the current study, we use Local Mean Decomposition (LMD) and an artificial intelligence model to forecast the price of CO: (1) Utilizing LMD to decompose the time series of CO price in an adaptive manner, removing the end-point effect, and further exploring the data’s various frequencies. (2) This work uses average mutual information (AMI) to decrease the calculation amount while taking into account the growth of the hybrid model’s calculation amount; (3) By separating the time series into random and stochastic variables, the econometric model is able to represent the volatility features of the crude oil price time series; (4) combining the outcomes of the prediction utilizing LSTM; (5) The experimental results demonstrate that the LMD-SD-ARIMA-LSTM suggested in this study outperforms a single model in terms of crude oil price prediction accuracy. They also demonstrate that the traditional econometric models can increase prediction accuracy through decomposition and aggregation. Researchers are still working on these problems. In comparison to previous studies, the novel hybrid model LMD-SD-ARIMA-LSTM has reduced the volatility and solved the issue of the overfitting problem of neural networks. The proposed hybrid technique is validated using publicly accessible data from the West Texas Intermediate (WTI), and forecast accuracy are compared using accuracy measures.

TABLE 1 Descriptive statistics for the WTI crude oil prices

The organization of the study is as follows. Section I consists of an introduction and a literature review. Section II provides a brief description of the methods used in this study. In the same way, sections II-D and III consist of analysis and discussion along with a conclusion respectively.

SECTION II.

Methodology

A. Local Mean Decomposition

Using adaptive time-frequency analysis, LMD is a technique for handling non-stationary signals [32]. Separating various envelope signals and purely frequency-modulated signals from the original signals is the foundation of the LMD approach. A physical significant product function (PF) component of the instantaneous frequency can be derived by multiplying the envelope signals with sole frequency modulated signals. The decomposition procedure for the initial signal x(t) can be broken down into five steps:

Select all local extremum points $n_{i}$ of the original signal x(t) and calculate the mean $m_{i}$ of adjacent extremum points $n_{i}$ , $n_{i+1}$ and envelope estimate $\alpha _{i}:$
$\begin{equation*} {\alpha }_{i}= \frac {\left |{ n_{i}-n_{i+1} }\right |}{2} \tag{1}\end{equation*}$ View Source The envelope estimate $\alpha _{i}$ and local means $m_{i}$ are then used to smooth using the moving average to $m_{11}(t)$ and envelope estimate function $\alpha _{11}(t)$ ;
Ignore the local mean function $m_{11}(t)$ in the original signal x(t), that is:
$\begin{equation*} {h}_{11}(\text {t})=\text {x(t)}-m_{11}(t) \tag{2}\end{equation*}$ View Source
Dividing it by $\alpha _{11}\left ({t }\right),h_{11}$ (t) is the amplitude demodulated.
$\begin{equation*} {s}_{11}(\text {t})=h_{11}(\text {t})/\alpha _{11}(t) \tag{3}\end{equation*}$ View Source purely frequency modulated signal is Repeated iterations n times until $s_{1n}$ (t). Stopping iteration should be done when $_{n\rightarrow \infty }\lim \alpha _{1n}$ (t) =1.
The corresponding envelope $\alpha _{1}$ (t) and the first component ${PF}_{1}\left ({t }\right)$ are obtained:
$\begin{align*} {\alpha }_{1}(\text {t})&=\alpha _{11}(\text {t})\alpha _{12}(\text {t})\ldots \alpha _{1n}(\text {t}) \\ &=\prod \nolimits _{q=1}^{n} {\alpha _{1q}\mathrm {(t)}} \tag{4}\\ {PF}_{1}(\text {t})&=a_{1}(\text {t})s_{1n}(\text {t}) \tag{5}\end{align*}$ View Source
The component ${PF}_{1}$ (t) is removed from x(t), the new signal $u_{1}(t)$ is obtained and the process is repeated k times until the signal $u_{k}(t)$ is a constant the oscillations have stopped, too. Finally, the original signal x(t) can be written as
$\begin{align*} x(\text {t})=\sum \nolimits _{p=1}^{k} {PF}_{p} \left ({t }\right)+u_{k}\left ({t }\right),\text {t}=1 \ldots \text {n} \tag{6}\end{align*}$ View Source

B. ARIMA

Box and Jenkins first proposed the ARIMA model in the early 1970s. The following structure of the model is said to be the autoregressive integrated moving average model, which is defined as ARIMA (p,d,q): The dependent variable must be stationary (through the I-component), and the independent variables are taken as all lags of the dependent variable (the AR-component) and/or errors lags. In general, therefore, one might consider an ARIMA model to be a specific kind of regression model (the MA component).

In general, a model with ARMA looks like this:

$\begin{equation*} {Y}_{t}= \alpha _{0}+ \sum \nolimits _{i=1}^{p} \alpha _{i} Y_{t-1}+ \sum \nolimits _{j=1}^{q} \beta _{i} \varepsilon _{t-j}+ \varepsilon _{t} \tag{7}\end{equation*}$ View Source

The AR coefficient and constant term are represented by p, whereas the MA coefficients are represented by q. The following steps are involved in modelling the ARIMA (p,d,q) model: First, the observation sequence’s stationaries are tested. If the observed sequence is not stationary, a difference in times d must be used to convert the sequence into a stable time series. Second, after the difference, the stationary sequence is subjected to the white noise test when the observed sequence is stationary. The ARMA (p,q) model is fitted if the test result is a sequence that is not white noise. ACF and PACF can be used to determine p and q. The analysis is over if the test yields a white noise sequence. The fitted ARMA (p,q) model’s residual sequence is next checked for white noise. The ARMA (p,q) model is re-fitted if the test result is a non-white noise sequence. If not, the analysis is over [33].

C. LSTM

It is a type of RNN with the capacity to take long-term dependability into account. Scientists The authors in [34] developed the LSTM in 1997. Because LSTMs can retain information over a longer amount of time and do not have long-term dependencies, they differ from other RNN techniques. Inside, LSTMs operate similarly to other RNN methods employing neural network gates and layers. They have a chain structure. The LSTM’s construction is designed to have a cell that runs the length of the device. Gates are used to control whether or not data may be transferred into the cell state. Additionally, there are parts known as gated cells that enable the storage of data from earlier LSTM outputs; this is the place where the memory-related aspects of LSTM come into play.

An advanced soft computing technique known as LSTM was developed from the Recurrent Neural Network (RNN). One of the numerous Artificial Neural Network (ANN) techniques, the RNN, was developed to address the ANN’s weakness in handling time correlation in the data sequence and enhances neurons in the networks with canonical connections to make it possible for RNN to create a sequence-to-sequence mapping between input and output data [24]. Unfortunately, the long-range dependencies are still a challenge for traditional RNN, which have difficulty learning the long-term temporal correlations due to expanding gradients or, conversely, vanishing gradients [25]. The authors in [34] used LSTM memory cells to get around this restriction. These cells use a three-gate mechanism made up of an input gate, an output gate, and a forget gate to store the temporal state of the networks [35]. Figure 1 shows an LSTM cell with all three of those gates as well as the cell state [36].

FIGURE 1.

The flow chart of the proposed model.

A New Approach for Forecasting Crude Oil Prices Based on Stochastic and Deterministic Influences of LMD Using ARIMA and LSTM Models

Alerts

Abstract:

Metadata

Abstract:

Introduction

Methodology

A. Local Mean Decomposition

B. ARIMA

C. LSTM

D. Evaluation

Empirical Analysis

A. Statistical Description of Data

B. Data Decomposition and Reconstruction

1) ARIMA Model

2) Stack-LSTM

Results and Discussions

Conclusion

A. Limitations of the Study

B. Future Recommendations

References