Journals & Magazines >IEEE Access >Volume: 8

Accurate Prediction Scheme of Water Quality in Smart Mariculture With Deep Bi-S-SRU Learning Network

The steps of the scheme are as follows: (1) A series of improved interpolation, smoothing and wavelet transform filtering techniques are used to repair, correct and denoi...

Abstract:

In the smart mariculture, the timely and accurate predictions of water quality can help farmers take countermeasures before the ecological environment deteriorates seriou...Show More

Metadata

Abstract:

In the smart mariculture, the timely and accurate predictions of water quality can help farmers take countermeasures before the ecological environment deteriorates seriously. However, the openness of the mariculture environment makes the variation of water quality nonlinear, dynamic and complex. Traditional methods face challenges in prediction accuracy and generalization performance. To address these problems, an accurate water quality prediction scheme is proposed for pH, water temperature and dissolved oxygen. First, we construct a new huge raw data set collected in time series consisting of 23,204 groups of data. Then, the water quality parameters are preprocessed for data cleaning successively through threshold processing, mean proximity method, wavelet filter, and improved smoothing method. Next, the correlation between the water quality to be predicted and other dynamics parameters is revealed by the Pearson correlation coefficient method. Meanwhile, the data for training is weighted by the discovered correlation coefficients. Finally, by adding a backward SRU node to the training sequence, which can be integrated into the future context information, the deep Bi-S-SRU (Bi-directional Stacked Simple Recurrent Unit) learning network is proposed. After training, the prediction model can be obtained. The experimental results demonstrate that our proposed prediction method achieve higher prediction accuracy than the method based on RNN (Recurrent Neural Network) or LSTM (Long Short-Term Memory) with similar or less time computing complexity. In our experiments, the proposed method takes 12.5ms to predict data on average, and the prediction accuracy can reach 94.42% in the next 3~8 days.

The steps of the scheme are as follows: (1) A series of improved interpolation, smoothing and wavelet transform filtering techniques are used to repair, correct and denoi...

Published in: IEEE Access ( Volume: 8)

Page(s): 24784 - 24798

Date of Publication: 03 February 2020

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2020.2971253

Funding Agency:

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

In the mariculture, water quality is one of the important factors that affect fish production. However, water quality is subject to change, because it is affected by many factors, such as fish density, feed, climate, and more. The drastic change of water quality can disrupt the balances of algae and bacteria phases. The unbalance of ecological environment can lead to serious consequences, such as the physiological stress, the disease, and even the massive death of fish. An accurate and real-time prediction of water quality parameters can help farmers take measures to adjust water quality in advance if necessary in order to ensure a suitable breeding environment. These measures can improve the efficiency of fish production. Furthermore, through the accurate prediction and the timely adjustment of water quality, the use of drugs can be reduced, which is of great significance for green and precision agriculture.

A. Related Work and Motivation

The collected water quality data usually needs to be preprocessed for data cleaning. Gao et al. proposed to repair the data by using linear interpolation and mean value smoothing [1]. The system clustering method and principal component analysis method have been used for feature selection to achieve dimensionality reduction of the input data in the prediction model. Finally, they employed wavelet denoising technology to deal with the key influencing factors. Zhang et al. proposed a missing data filling method based on convolutional neural network, which has been used to fill data with temporal correlation between time-series and spatiotemporal correlation between sensor nodes [2]. Yang et al. proposed a data preprocessing method based on feature extraction and clustering. The Lasso algorithm and K-Means algorithm were used to extract and cluster the temperature data respectively, which have greatly improved the prediction accuracy of temperature [3]. Xia et al. proposed the optimal mixed imputation (OMI) algorithm for missing data filling [4]. Maria et al. proposed a preprocessing method for decomposing meteorological data using wavelet decomposition and principal component analysis [5]. Though the aforementioned methods can improve the precision of data preprocessing, their structures are complex and difficult to be implemented. Meanwhile, traditional linear interpolation methods have breakpoint phenomena in actual interpolation, while the mean smoothing method can only be used for the dataset with less deviation.

Next, Pearson correlation coefficient has been applied to analyze the correlation between the predicted water quality parameters and other water quality parameters. Advanced integration method as spatial cross-correlations [6] can remedy some shortcomings of Pearson correlation coefficient method such as inaccuracy and fluctuation of correlation analysis results when objects to be processed are insufficient. Considering the abundant experimental objects in our paper, the relatively simple Pearson correlation coefficient method is going to be imported in our experiments.

For water quality prediction, the major approaches include time series method [7]–[9], Markov method [10], grey system theory method [11] and support vector regression machine method [12], [13]. However, these methods have some drawbacks, such as weak generalization ability, low computational efficiency and unstable prediction accuracy. Hence, they cannot meet the ever-increasing requirements in precision agriculture. In recent years, the prediction methods based on ANN (Artificial Neural Network) and deep learning have been proposed [14], [15]. They have the advantages of good robustness, high fault tolerance and sufficient fitting of complex nonlinear relations. Liu et al. used BP neural network to predict multi-scale water temperature based on empirical mode [16]. Han et al. established a water quality prediction model in wastewater treatment based on an improved radial basis function neural network with flexible structure [17]. Miao et al. used Levenberg Marquardt (LM) neural network and genetic algorithm to build a dissolved oxygen prediction model [18]. What’s more, prominent water quality prediction models based on LSTM have also been constructed [11], [19], [20].

B. Main Contributions of the Paper

In this paper, we design a procedure to fullfill the prediction of the key water quality parameters. To improve the data cleaning in the preprocessing stage, the fixed threshold method is used to discard the abnormal individual data, and the mean proximity mehthod is used to complete the collected data. Then, the wavelet analysis and improved smoothing method are used for noise reduction and error correction respectively. Next, the Pearson correlation coefficient method is employed to discover the correlation between the key water quality parameters. In the prediction phase, combined with the results after preprocessing and the obtained correlation prior, the prediction model based on our proposed Bi-S-SRU deep learning network is used to predict the key water quality parameters.

The Bi-S-SRU model is proposed to improve the RNN [21], LSTM [22] and SRU [23], [24] network structures. It has the advantages of simple structure, fast convergence, and good stability. Our proposed Bi-S-SRU model is mainly composed of two stages. The first stage is the preprocessing of collected water quality data. The second stage is the construction of the Bi-S-SRU-based water quality prediction model. In addition, we discuss the prediction results of different water quality parameters in same environment setting, and compare the Bi-S-SRU-based method with three other aforementioned methods.

Our main contributions can be summarized as follows:

In the data preprocessing, the proposed mean proximity method and improved smoothing method can accurately complete and correct the water quality data to be repaired, which solves the breakpoint phenomenon and increases the accuracy of data cleaning.
The Bi-S-SRU deep learning network is proposed, which can integrate the future context information into the prediction of the current time point data. Meanwhile, according to the existing dynamic model, the degree of correlation between important parameters is analyzed. According to the results of correlation analysis, the training data of the learning model are multiplied by the corresponding weight coefficient.
An overall scheme for accurately predicting water quality parameters is proposed. This scheme uses the pre-processed data and correlation priors to train a Bi-S-SRU model to obtain a prediction model. The prediction model is then used to predict key water quality parameters in aquaculture.
We build and expose a large raw data set collected in time series, which contains water quality and climate environment data at 23,204 time nodes.

C. Paper Organization

The rest of this paper is arranged as follows. Section II gives the acquisition method of data and the outline of the proposed scheme. Section III introduces the preprocessing process of water quality. Section IV presents the network structure of Bi-S-SRU and the construction of prediction model. In Section V, we analyze and discuss the experimental results. Section VI summarizes our work and illustrates future works.

SECTION II.

Materials and Overview of Methodology

A. Acquisition of Data

In our investigation, we conduct our study by using real data collected in the marine aquaculture base in Xincun Town, LingShui County, Hainan Province, China. Fig. 1 illustrates the water quality data acquisition module, transmission module, cloud server module, and terminal display module in the IoT system. The IoT hardware system mainly includes one multi-sensor node, one wind power generation device, one set of solar power panels, one 4G industrial routing module, one wind-solar complementary controller, one local storage module and one wireless transmission module. From Fig. 1, the IoT system realizes the data acquisition, transmission, cloud storage of the data, business logic development, intelligent prediction analysis and calculation, and related application services.

FIGURE 1.

The topology structure diagram of the smart mariculture IoT system.

MIT Libraries

MIT Libraries

Accurate Prediction Scheme of Water Quality in Smart Mariculture With Deep Bi-S-SRU Learning Network

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

A. Related Work and Motivation

B. Main Contributions of the Paper

C. Paper Organization

Materials and Overview of Methodology

A. Acquisition of Data

B. Overview of Scheme

Preprocessing of Water Quality Data

A. Threshold Processing and Data Completion Method Selection

B. Data Filtering Method Selection

1) Moving Average Filter Method

2) Median Filter Method

3) Wavelet Transform Method

4) Comparison of Data Filtering Methods

C. Error Correction Method Selection

D. Correlation Analysis

Proposed Bi-S-SRU Based Prediction Model

A. Principle of SRU Deep Learning Model

B. Principle of Bi-S-SRU Deep Learning Model

Definition 1 [MAE (Mean Absolute Error)]:

Definition 2 [RMSE (Root Mean Squared Error)]:

Definition 3 [MAPE (Mean Absolute Percent Error)]:

C. Construction of Bi-S-SRU Prediction Model and Metrics Analysis

Experimental Results and Discussions

A. Comparison of Prediction Effects for Different Parameters

B. Comparison of Training Time for RNN, LSTM, SRU and Bi-S-SRU

Conclusion and Future Work

Author Contributions

Appendix

Appendix

References

IEEE Account

Purchase Details

Profile Information

Need Help?