Introduction
With the application of technologies such as massive multi-input multi-output transmission, millimeter-wave (mm-mave) communication and non-orthogonal multiple access scheme, 5G mobile communication has significantly increased the system capacity and supported massive connections [1]. However, the 5G network is still ground based [2]. Vast communication demand at sea is greatly challenged due to the limited 5G network coverage. Thus, the envisaged 6G network is expected to provide global wireless connectivity from space to underwater. As a complementary technology for terrestrial communication, optical wireless communication, like laser communication or visible light communication, can be used to support underwater communication and expand the wireless coverage [3].
Underwater optical wireless communication (UOWC) employs the visible light spectrum (mainly the blue-green wavelength) for data transmission. It can provide ultra-high data rate, low time latency and high security [4]. Besides UOWC, underwater acoustic communication and radio frequency (RF) communication can also be used for underwater wireless communication. The underwater acoustic technology is the widely used method for underwater communication in the past decades. It can support long-distance transmission up to several tens of kilometers [5]. However, using sound for the underwater signal transmission is not simple since the acoustic channel is time-varying and space-varing. The extremely low data rate (only Kbps) also limits its application. Compared to the acoustic communication, RF system can provide a higher data rate up to tens of Mbps. As the radio wave can easily pass through the air-water interface, the underwater communication and terrestrial communication can be switched over smoothly [6]. The defect of this technology is that the distance is confined in the range of several meters due to the conductivity of the water. In comparison with its two counterparts, UOWC is an extremely competitive approach for underwater wireless communication, which provides a tradeoff between the transmission distance and the data rate. With UOWC technologies, moderate distance in the range of tens of meters can be supported while the data rate can reach up to Gbps [7], [8].
Nevertheless, using optical wireless communication in underwater also faces several challenges. In underwater environment, optical signal suffers from severe water absorption and scattering. The interaction between photons and water molecules or other particulate matters causes that the propagation characteristics of the light beam becomes extremely complicated and varies with different water body. As a result, it is elusive to mathematically describe the underwater optical channel accurately and thereafter so is the signal detection. In spite that Radiative Transfer Equation (RTE) provides a theoretical model to describe the absorption and scattering effect in water, deriving an analytical solution is bristling with difficulties. Besides that, improving the transmission distance from tens of meters to hundreds of meters in UOWC is an another challenging work. Using single photon avalanche photondiode (SPAD) as the detector provides a potential solution. Compared to the conventional avalanche photondiode (APD), SPAD is more sensitive so that a single photon can be detected [9]. However, the dead time contributes to the nonlinearity between the average input photon and the average output voltage. As a result, the photon detection exhibits non-Poisson despite the photons arrival is a Poisson process [10].
To deal with the complexity of the mathematical modeling and overcome the nonlinear distortion of SPAD for signal detection in UOWC, deep learning can be used for this physical-layer communication [11]. As a subbranch of machine learning, deep learning can provide a data-driven method to solve this problem by extracting the feature of large amounts of training data and optimizing the system performance [12]. In such a system, the mathematical model for the processing block is optional and can be replaced by the deep learning network. Although whether the deep learning based wireless communication outperforms the traditional one in terms of the performance improvement and complexity is an open issue, it is believed that the deep learning based algorithm can be executed faster with lower power consumption compared to its counterpart [11].
Recently, the deep learning method has been applied in optical wireless communication to address the physical-layer issues. In [13], a Gaussian kernel-aide deep neural network is applied for 8-pulse amplitude modulation based signal detection to compensate for the nonlinear distortion in an underwater optical system. To mitigate the high peak-to-average power ratio (PAPR) of the optical orthogonal frequency division multiplexing (O-OFDM) based visible light communication system, an autoencoder network is proposed with efficient learning and end-to-end performance optimization [14]. Similar autoencoder network framework is proposed for binary signal designing in LED-based visible light communications [15]. The cost function is deliberatedly designed to satisfy the target dimming constraints. In [16], three demodulators based on convolution neural network, deep belief network and adaptive boosting are compared in a hardware prototype of an end-to-end VLC system with practical datasets.
Against this background, in this paper, a novel deep learning aided signal detection scheme is proposed for SPAD based UOWC system. A two-connected multilayer perceptron (MLP) network as the deep learning architecture is integrated into the signal processing blocks. The first block of the connected MLP network can be regarded as the channel compensation block while the second one works as a demodulator. Compared to existing channel compensation or channel equalization methods, the channel side information is no longer needed for the deep learning aided signal detection. With sophisticated numerical optical channel model and SPAD non-Poisson model, large amounts of training data can be utilized to train the proposed network. Our simulation results show that the deep learning method can address the optical channel distortion as well as SPAD non-Poisson distortion, and thereafter improve the system performance.
The remainder of this paper is organized as follows: In Section II, we describe the characteristics of the UOWC systems, including the inherent optical property, the underwater optical channel and the SPAD nonlinear detection. The deep learning aided signal detection scheme is introduced in Section III. Section IV presents the simulation results to validate our proposed scheme. Finally, the conclusions are drawn in Section V.
System Characteristics
A. Inherent Optical Property
The light propagation characteristics in water is quite different from that in the free space since the photons interact with the particles in water more severely and frequently. As is shown in Fig. 1, when the light moves in water, some parts of the incident beam are absorbed in the water and some parts are scattered by the particles. The absorption effect as well as the scattering effect is noticeable in water and would cause the light attenuation.
To mathematically describe these effect, two inherent optical properties of the light are introduced, namely the absorption coefficient \begin{equation*} c(\lambda)=a(\lambda)+b(\lambda),\tag{1}\end{equation*}
Volume scattering phase function is an another inherent optical property related to the scattering effect. It is defined as the proportion of the light scattered out from the incident beam with a solid angle \begin{equation*} \beta (\Omega)=\lim _{d l\to 0}\lim _{d \Omega \to 0}\frac {L_{s}(\Omega)}{L_{i} dl d \Omega },\tag{2}\end{equation*}
The volume scattering phase function indicates that the scattering in water is non-isotropic and highly forward. Some field experiments have already been done to measure this phase function for the selected realistic water types [17]–[19]. However, these collected data cannot be used directly for theoretical analysis. Thus, several types of analytical models have been proposed. The Henyey-Greenstein phase function is widely used due to its simple expression [20]. Nevertheless, this function is inadequate to represent the realistic light forward scattering at small angle and backscattering at large angle in water. To better describe the realistic light scattering, the two-term Henyey-Greenstein phase function is proposed by summing of two Henyey-Greenstein phase function with different weights [21]. Another more complicated analytical model is the Fournier-Forand marine phase function, which fits well with the Petzold experiment data for San Diego harbor [22].
Based on the volume scattering phase function, the scattering coefficient \begin{equation*} b=\int _{0}^{4\pi }\beta (\Omega)d\Omega =2\pi \int _{0}^{\pi }\beta (\varphi)\sin \varphi d\varphi.\tag{3}\end{equation*}
\begin{equation*} \hat {\beta }(\varphi)=\frac {\beta (\varphi)}{b}.\tag{4}\end{equation*}
B. Underwater Optical Channel
Based on the attenuation coefficient, the Beer Law is used to model the underwater optical loss, which is formulated as [4] \begin{equation*} I(L)=I(0)\exp (-cL),\tag{5}\end{equation*}
The expression of the Beer Law is simple to be used for optical link budget. However, it is not accurate since the photons scattered out of the incident beam are not taken into account when calculating the radiance of the received light. Instead, in consideration of the energy conservation, a more complex but accurate equation, called RTE, is derived to theoretically describe all photons moving through the water along a path toward a given direction. For brevity, the time-independent and source-free RTE can be given by [23] \begin{align*}&\hspace {-0.5pc}\cos \varphi _{2}\frac {dI(\varphi _{2},\theta _{2})}{dl}=-cI(\varphi _{2},\theta _{2}) \\&+\!\!\int _{0}^{\pi }\!\!\int _{0}^{2\pi }\beta \big (\langle \varphi _{1},\theta _{1}\rangle,\langle \varphi _{2},\theta _{2}\rangle \big)I(\varphi _{1},\theta _{1})\sin \varphi _{1}d\theta _{1}d\varphi _{1}.\tag{6}\end{align*}
Since the equation involves the integrals and derivatives, deriving an exact analytical solution is extremely difficult. Therefore, the Monte Carlo approach has been proposed to numerically evaluate the light radiance when the light passes through water [24]. In this numerical approach, the trajectories of all photons from the light source are simulated. As shown in Fig. 2, each photon would be scattered by the materials in water for several times. During each scattering time, the photon would change its moving direction randomly. The elevation angle \begin{equation*} \left [{ \begin{matrix} \hat {e}_{x} \\ \hat {e}_{y} \\ \hat {e}_{z} \end{matrix} }\right]\!=\!\! \left [{ \begin{matrix} \sin \varphi _{i} \cos \theta _{i} \\ \sin \varphi _{i} \sin \theta _{i} \\ \cos \varphi _{i} \end{matrix} }\right]\!\!\left [{ \begin{matrix} \dfrac {e_{x}e_{z}}{\sqrt {1-e_{z}^{2}}} & \dfrac {-e_{y}}{\sqrt {1-e_{z}^{2}}} & e_{x} \\ \dfrac {e_{y}e_{z}}{\sqrt {1-e_{z}^{2}}} & \dfrac {-e_{x}}{\sqrt {1-e_{z}^{2}}} & e_{y}\\ -\sqrt {1-e_{z}^{2}} & 0 & e_{z} \end{matrix} }\right],\tag{7}\end{equation*}
\begin{equation*} r_{i}=-\frac {1}{c}\ln u_{i},\tag{8}\end{equation*}
\begin{equation*} w_{i}=\alpha ^{i}w_{0},\tag{9}\end{equation*}
The Monte Carlo approach gives us a numerical underwater optical channel model. In this model, the channel impulse response varies greatly with different water types, link distance and receiver characteristics. Nevertheless, in most cases, the channel time dispersion can be neglected. Even when the light travelling distance is very long and the water is turbid, the maximum delay is less than 12 ns [25]. Thus, the signal suffers from low inter-symbol interference.
C. SPAD Nonlinear Detection
Compared to APD, SPAD has higher sensitivity and can detect single photon as a photon counting receiver. It is useful for signal detection in long distance transmission or low optical power scenarios. SPAD is working in Geiger-mode that an APD is biased with an excess bias voltage above the breakdown voltage. When a photon arrives at the SPAD, an avalanche can be triggered. This avalanche event reduces the SPAD voltage below the breakdown voltage. Before detecting a second photon, SPAD has to be recharged till its voltage is raised to the original bias level with the quenching circuits. This period between the starting of the avalanche event and the recovery of the bias voltage is called as the dead time. Typically, there are two types of quenching circuits configuration that have an impact on the dead time, namely the active quenching (AQ) circuits and the passive quenching (PQ) circuits. For an AQ SPAD, with the use of the AQ circuit, SPAD would be recharged rapidly to the original bias level even when SPAD is fired by another photon in the recharge process. As a result, the dead time would keep constant. While for a PQ SPAD, it can still be triggered by another photon as if the SPAD voltage is above the breakdown voltage. In other words, anther photon arriving during the recharge process can paralyze the device and extend the dead time.
Note that the photon arriving can be modeled as a Poisson process. The probability of detecting \begin{equation*} P_{\text {ideal}}(k)=\frac {(\lambda _{p} T)^{k}e^{-\lambda _{p} T}}{k!},\tag{10}\end{equation*}
When the dead time is considered, the photon detection no longer follows the Poisson distribution. Typically, the dead time
For an AQ SPAD, its probability of the detected photon is given by [10] \begin{align*} P_{\text {AQ}}(k)=\begin{cases} \sum \limits _{i=0}^{k}\dfrac {\lambda _{p}^{i}(T-(k+1)\tau)^{i}}{i!}e^{-\lambda _{p}(T-(k+1)\tau)}\\ \quad - \sum \limits _{i=0}^{k-1}\dfrac {\lambda _{p}^{i}(T-k\tau)^{i}}{i!}e^{-\lambda _{p}(T-k\tau)}, &\!\!\!\!\!\!\!\!\!\mathrm {if}\;{k\!< \! k_{\max }},\\ 0,&\!\!\!\!\!\!\!\!\!\mathrm {if}\;{k\geq k_{\max }}. \end{cases} \\{}\tag{11}\end{align*}
\begin{align*}&\hspace {-1pc}E_{\text {AQ}}(k)=(k_{\max }-1) \\&\qquad ~\,\,-\sum \limits _{k=0}^{k_{\max }\!-\!2}\sum \limits _{i=0}^{k}\frac {\lambda _{p}^{i}(T\!-\!(k\!+\!1)\tau)^{i}}{i!}e^{-\lambda _{p}(T-(k+1)\tau)}.\tag{12}\end{align*}
While for a PQ SPAD, the probability of the detected photon is given by [10] \begin{align*} P_{\text {PQ}}(k)=\begin{cases} \sum \limits _{i=k}^{k_{\max }}(-1)^{i-k}\dfrac {\lambda _{p}^{i}(T-i\tau)^{i}}{k!(i-k)!}e^{-i\lambda _{p}\tau }, &\mathrm{if}\;{k\!< \! k_{\max }},\\ 0,&\mathrm{if}\;{k\!\geq \! k_{\max }}. \end{cases} \\{}\tag{13}\end{align*}
\begin{equation*} E_{\text {PQ}}(k)=\lambda _{p} e^{-\lambda _{p}\tau }(T-\tau).\tag{14}\end{equation*}
From (12) and (14), it can be observed that the dead time could cause the nonlinearity between the arrived photons and the detected photons. To mitigate this nonlinear impact of the dead time, a SPAD array with multiple elements can be used. When photons arrive at the SPAD array, some elements would be triggered while others are still active for another photon detection. However, due to the limited number of the elements in the array, the nonlinearity cannot be eliminated completely [9].
System Design
In traditional wireless communications, processing blocks such as modulation, channel estimation and channel equalization rely on the mathematically expressed models. However, in some practical scenarios, the interaction between the signal and the environment becomes extremely complex and establishing such mathematical models are intractable [26], [27]. Combining the deep learning with the mature existed mathematically expressed model or associated algorithm for processing block can boost the intelligent communication and achieve comparable performance gains [28]. In this section, a novel signal detection scheme is proposed where the MLP network is integrated into the SPAD based UOWC system with direct current biased optical orthogonal frequency division multiplexing (DCO-OFDM) employed for signal modulation. Among various deep learning network architectures, such as MLP network, convolutional neural network (CNN), recurrent neural network (RNN) and others built upon them, the MLP network is adopted in our work. This is because applying MLP network into signal processing can achieve comparable system performance compared with the CNN network and RNN network but with lower computational time [29]. Besides, the architecture of MLP network is simple and easy to be trained compared to the RNN network where the gradient vanishing problem is hard to deal with [30].
A. SPAD Based UOWC System With DCO-OFDM Modulation
The block diagram of the SPAD based UOWC system is illustrated in Fig. 3. Here, the DCO-OFDM modulation scheme is considered due to its high spectrum efficiency. Specifically, the transmitted bits \begin{equation*} X_{k} = X^{*}_{N-k}, \quad k = 0, 1,\ldots,N/2 - 1,\tag{15}\end{equation*}
\begin{align*} x_{n}=&\frac {1}{\sqrt {N}}\sum _{k=0}^{N-1} X_{k}e^{j\frac {2\pi }{N}nk}\qquad \\=&\frac {1}{\sqrt {N}}\sum _{k=1}^{\frac {N}{2}-1}\left[{X_{k}e^{j\frac {2\pi }{N}nk}+X^{*}_{k} e^{-j\frac {2\pi }{N}nk}}\right] \qquad \\=&\frac {1}{\sqrt {N}}\sum _{k=1}^{\frac {N}{2}-1}\text {Re}\left({X_{k} e^{j\frac {2\pi }{N}nk}}\right), \quad n = 0,1,\ldots, N - 1,\tag{16}\end{align*}
At the receiver, to increase the link distance, SPAD is used as the photon detector. The received signal can be expressed as \begin{equation*} y_{n}=g_{2}(g_{1}(x^{\text {(opt)}}_{n}))\tag{17}\end{equation*}
Conventionally, after removing CP, fast Fourier transformation (FFT) is conducted to obtain the frequency symbols from the sampled data. The frequency symbols are sequently equalized and demodulated to recover the original information. Unlike this traditional method, in this paper, a two-connected MLP network is applied to the signal detection by employing the features of the received signals suffering from the distortion of the underwater optical channel and the detector.
It should be mentioned that in machine learning, the training data have a great impact on the performance of the deep learning network and finding such high-quality training data is tricky. Fortunately, obtaining a dedicated training set for network training is much easier in the field of wireless communications since there are many sophisticated models to generate simulated data that match the practical data well, especially in the physical layer. In our scheme, the numeric underwater optical channel model and non-Poisson SPAD model detailed in the previous section are adopted for data generation.
B. Deep Learning Aided Scheme
The architecture of the two-connected MLP network is illustrated in Fig. 4. Instead of using one MLP network, two MLP subnetworks are connected intuitively motivated by the signal processing model to improve the training accuracy and reduce the complexity. The first subnetwork of the connected MLP network can be regarded as the channel compensation block while the second one can be regarded as a demodulator. Compared to existing channel compensation or channel equalization methods, the channel side information is no longer needed for the deep learning aided scheme. The loss functions used here to train the network are detailed in the next subsection.
The inner structure of each subnetwork is further shown in Fig. 5. Each subnetwork consists of one input layer, multiple hidden layers and one output layer. The input layer is the first layer of the MLP subnetwork. Its input is the training data. The last layer of the subnetwork is the output layer which output the data that approaches the expected value. The hidden layers are some special layers where the network does not indicate the exact form or value should be output. They are just used to extract the features of the signals and find the inner connection among them. The output of the previous layer is also the next input of the next hidden layer. Accordingly, the output of each subnetwork can be expressed as \begin{equation*} \hat {s}_{n}=f_{N_{l}}(f_{N_{l}-1}(\cdots f_{1}(u_{n}))),\tag{18}\end{equation*}
\begin{equation*} \hat {h}_{l,q}=\sum _{p=1}^{P_{l}}w_{l,p,q}h_{l,p}+b_{l,q},\tag{19}\end{equation*}
\begin{equation*} \bar {h}_{l,q}=\gamma _{l,q}\frac {\hat {h}_{l,q}-\mu _{l,q}}{\sqrt {\sigma ^{2}_{l,q}+\eta }}+\beta _{l,q},\tag{20}\end{equation*}
Next, the activation function is introduced to obtain the final output of this layer. There are two kinds of activation functions to be used in our proposed network. The first activation function is the relu function, which is used in all hidden layers and given by \begin{equation*} \sigma _{\text {relu}}(\bar {h}_{l,q})=\max (0,\bar {h}_{l,q}), \quad l=1,2,\cdots,N_{l}-1.\tag{21}\end{equation*}
\begin{equation*} \sigma _{\text {sigmoid}}(\bar {h}_{N_{l},q})=\frac {1}{1+e^{-\bar {h}_{N_{l},q}}}.\tag{22}\end{equation*}
C. Offline Training
Generally, the purpose of offline training of neural network is to reduce the loss function by optimizing the parameters of the MLP network. Here, the mean square error is adopted as the loss function, which is defined as \begin{equation*} l=\frac {1}{n}\sum _{i=1}^{n}(\hat {s}_{n}-s_{n})^{2},\tag{23}\end{equation*}
The training procedure of the deep learning aided signal detection is summarized in Algorithm 1. The first MLP subnetwork is trained with the back-propagation method, which involves the gradient descent algorithm [32]. There are different kinds of gradient algorithms applied into deep learning network. Since the learning rate determines the converging rate of the deep learning network, a self-adaptive learning rate algorithm, called Adam algorithm, is used in our offline training [33]. Adam algorithm exploits the exponential moving averages of the gradient and the squared gradient to scale the learning rate. The weights and bias are updated by \begin{align*} w_{l,p,q}^{[k]}\leftarrow&w_{l,p,q}^{[k-1]}-\epsilon \frac {m_{l,p,q}^{[k]}}{\sqrt {n_{l,p,q}^{[k]}}+\delta },\\ b_{l,q}^{[k]}\leftarrow&b_{l,q}^{[k-1]}-\epsilon \frac {\hat {m}_{l,p,q}^{[k]}}{\sqrt {\hat {n}_{l,p,q}^{[k]}}+\delta },\tag{24}\end{align*}
\begin{align*} m_{l,p,q}^{[k]}\leftarrow&\frac {\alpha _{1}m_{l,p,q}^{[k-1]}+(1-\alpha _{1})g^{[k]}_{w}}{1-\alpha _{1}^{k}}, \\[-2pt] n_{l,p,q}^{[k]}\leftarrow&\frac {\alpha _{2}n_{l,p,q}^{[k-1]}+(1-\alpha _{2})(g^{[k]}_{w})^{2}}{1-\alpha _{2}^{k}},\\[-2pt] \hat {m}_{l,p,q}^{[k]}\leftarrow&\frac {\alpha _{1}\hat {m}_{l,p,q}^{[k-1]}+(1-\alpha _{1})g^{[k]}_{b}}{1-\alpha _{1}^{k}},\\[-2pt] \hat {n}_{l,p,q}^{[k]}\leftarrow&\frac {\alpha _{2}\hat {n}_{l,p,q}^{[k-1]}+(1-\alpha _{2})(g^{[k]}_{b})^{2}}{1-\alpha _{2}^{k}}.\tag{25}\end{align*}
After the first MLP subnetwork is trained, the paraments and its structure is retained. The second MLP subnetwork is then trained with the same method. Once the training of the two subnetworks is completed, the whole network structure with its trained parameters are embedding into the receiver. In the online processing procedure, the received signal is fed to the MLP network. With simple mathematical calculation, the original information can be recovered in a forward propagation manner.
Performance Evaluation
In this section, some simulation results are carried out to test the performance of the proposed scheme. The clear ocean and coastal water are chosen as two typical homogeneous water types. Their absorption, scattering and attenuation coefficients are given in Table 1. The dominant background noise is the dark count noise while the noise from the ambient light is not considered in the simulation. In the optical communication system configuration, a commercial laser diode with the maximum optical power of 85 mW is used at the transmitter while a SPAD array with 64 elements is adopted as the photon detector. For each SPAD element, the dead time is 5 ns. Besides, a 4-QAM based DCO-OFDM with 16 subcarriers is utilized to modulate the transmitted information. The sampling rate for the DCO-OFDM symbol at the receiver is set as 10 MHz. Correspondingly, the photon counting interval is 100 ns and the maximum number of counted photons in a single SPAD element is 21 during a photon counting interval. The specifical photon number in each SPAD element is then generated with the non-Poisson SPAD model (i.e., (11) or (13)). Therefore, the detected photons for each signal sample can be obtained by summing up the detected photons from each SPAD element. In addition, the photon energy for blue light is
In the MLP network configuration, as the subnetworks work as different signal processing blocks, the number of neurons in their hidden layers are different. In the first subnetwork, there are two hidden layers. The neurons in each layer are 512 and 256, respectively. As the first subnetwork is used for channel compensation, the number of its input data and output data are the same, which equal the number of subcarriers in a DCO-OFDM symbol (i.e., 16). On the other hand, for the second subnetwork, there are two hidden layers with 64 neurons in each layer. Since its input is the output of the first subnetwork, the number of the input data is also 16. The number of its output is set as 14 since the transmitted bits are modulated only on the half of subcarriers of a DCO-OFDM symbol except for the subcarriers
To validate the effectiveness of our proposed deep learning aided signal detection scheme, two other schemes are utilized as two baselines. In the first scheme, after FFT operation with the received signal, the frequency-domain signal is obtained and be demodulated with the logarithmic likelihood ration (LLR) based soft-decision directly. In the scheme, this demodulation block is called as the LLR demodulator. While in the second scheme, a MLP network is combined with the LLR demodulator. This MLP network is only used for channel compensation to mitigate the distortion caused by the optical channel and SPAD. Its architecture is the same as that of the first subnetwork in the proposed scheme. Note that the light moving distance is used in the simulation instead of the signal-to-noise ratio. This is because the photons’ interaction with the particles in the water lead to a complex relationship between the distance and the attenuation in UOWC. The impact of the distance is considered afterwards.
A. Performance Comparison
Fig. 6(a)-(f) demonstrate the performance comparison of the proposed scheme with the two baseline on the received signals in different cases. Here, two channel models are considered to generate the received signals, namely the numerical optical channel model and the Beer Law model. Besides, the performance on AQ SPAD and PQ SPAD are also investigated. From these figures, it can be observed that the overall BER performance of the proposed scheme is better than the two baselines. The difference between the two baselines is that a deep learning network structure (also the first subnetwork of our proposed scheme) is utilized for channel compensation. With such network, the BER can be improved greatly. In our proposed scheme, the second subnetwork is adopted by replacing the LLR demodulator, which can be regarded as a fine-tuning process. Hence, the BER can be further improved with the prior knowledge of the original transmitted bits.
1) Comparison of Water Types
In Fig. 6, the BER performance in the cases of different water types are compared. When using the proposed two-connected MLP network, for the clear ocean water type, the BER performance can be improved more in the relatively short distance than the long distance. This is because the data in the relative short distance suffers less distortion imposed by the channel and SPAD as well as the noise. The network can better extract the features of the received signals and recover the original information. On the contrary, in the relatively long distance, the data has extremely low power and is severely polluted by the non-Poisson-related noise and the dark count noise. The deep learning network can no longer discern the received signal. The recovery of the information bits from such data would contribute to an extremely low BER. The performance is even worse than that of directly demodulating the signal. More importantly, it can be seen that the degree of the distance improvement in clear ocean water with the proposed scheme is higher than that in coastal water. Specifically, in consideration of the same BER target (below the forward error correction threshold), the improved distance can be extended up to 3 m in clear ocean water with the proposed scheme whereas the improvement is only 1.8 m for coastal water. This effect is caused by the light characteristics when travelling in different water. In the water with higher turbility, the optical channel would deteriorate rapidly even in a short distance. Accordingly, its BER would drop faster than in the clearer water and the proposed scheme could not further recover the transmitted information.
2) Comparison of SPAD Types
The performance of the proposed scheme on different SPAD types are also investigated in Fig. 6. The two subfigures in the left column (Fig. 6(a)&(c)) are the simulation results for AQ SPAD while the two subfigures in the right column (Fig. 6(b)&(d)) are the results for PQ SAPD. It can be observed that there is a slightly difference between the impact on AQ SPAD and PQ SPAD, especially in the relatively short distance range. The achieved BER with the proposed scheme for AQ SPAD is better than PQ SPAD at the same distance. This is reasonable because the dead time would be extended by another photon arrival during the previous photon detection in PQ SPAD while the dead time of AQ SPAD retains constant. Therefore, the nonlinearity caused by PQ SPAD as well as the received signal is mildly worse than the AQ SPAD.
3) Comparison of Optical Channel Model
As is illustrated in Fig. 6, the performance of the proposed scheme on the received signal generated with different optical channel models are studied. For fair comparison, their MLP network parameters are obtained with their own model-generating training data. An interesting result can be noticed from Fig. 6(a) & (e), which is the achieved BER is better for data generated with the numerical optical channel than the widely used Beer Law model for coastal water at the same distance. This occurs because a higher received signal power with the numerical channel model can be obtained compared to the Beer Law model where the scattering photons are assumed to be lost. On the other hand, from Fig. 6(c) & Fig. 6 (f), it can be seen that the distance where the proposed scheme is working for the data generated with the numerical optical channel model (i.e., smaller than 75 m) is smaller than the Beer Law model (i.e., larger than 88 m) in the clear ocean water case. This is supported by the fact that the beam spreading is considered in the numerical optical channel model, which causes the additional of optical power loss compared to the Beer Law model when the light propagates through a long distance. This would further cause the received power of Beer Law model is higher than the numerical optical channel model and lead to better received signals.
B. Impact of Training Epoch
Finally, in Fig. 7, the impact of the training epoch on the loss function of the validation data and the BER performance of the two-connected network is evaluated. In this case, the water type is the coastal water and the training data is generated with the numerical optical channel model and AP SPAD model. Clearly, for the first subnetwork, as the epoch increases, the value of the loss function continuously decreases until a stead stable is obtained. However, the loss function of the second subnetwork firstly decreases and then slowly rises after the epoch increases to be 45, which is common (i.e., overfitting) in deep learning network. As a result, the overall BER trend of the output data of the second subnetwork goes up. To address this issue, the early stopping method (i.e., stopping the network training when a good result is obtained) is adopted in our simulation [32]. In addition, the loss as well as the BER goes up and down after the epoch reaches 65 in spite that the general trends of their values are climbing up. This is because in each epoch, the training data is randomly chosen from the dataset and the change of the data’s quality determines the variations of the loss and BER.
Performance comparison of each training epoch for the proposed network (Training data: coastal water, numerical optical channel model and AP SPAD model).
Conclusion
In this paper, we propose a novel deep learning aided signal detection scheme for SPAD based UOWC system to deal with the distortion imposed by the underwater optical channel and SPAD. Compared to existing channel compensation or channel equalization methods, the channel side information is no longer needed for the proposed scheme. With the prior expert knowledge of the signal processing method, a two-connected MLP based deep learning network is adopted and embedded into the receiver. The first subnetwork is built for channel compensation while the second one works as a demodulator. Consequently, the two subnetworks are trained sequently with different loss functions. Besides, to train the proposed network properly, vast training data for different water types are simulated with the numerical optical channel model and SPAD non-Poisson model. Simulation results show that the overall BER performance of the proposed scheme is better than two baselines. In addition, the degree of distance improvement in clear ocean water with the proposed scheme is better than that in coastal water as the optical channel would deteriorate rapidly for a short distance variation in coastal water.