Journals & Magazines >IEEE Access >Volume: 8

Wihi: WiFi Based Human Identity Identification Using Deep Learning

The basic idea of our proposed approach is to extract representative features for RNN training, and then perform identity identification. Moreover, the proposed Wihi appr...

Abstract:

Human identity identification based on channel state information (CSI) using commercial WiFi devices has drawn increasingly attention, and it can be used in many applicat...Show More

Metadata

Abstract:

Human identity identification based on channel state information (CSI) using commercial WiFi devices has drawn increasingly attention, and it can be used in many applications such as smart home, intrusion detection, building monitoring, activity recognition, etc. However, most of the existing identity identification approaches are sensitive to the influence of random noise derived from indoor environments, and thus their identification accuracies are far from satisfactory. In the present paper, a device-free CSI based human identity identification approach using deep learning (Wihi) is proposed. Wihi mainly utilizes three key techniques to identify different people. Firstly, to eliminate the influence of the random noise, discrete wavelet transform (DWT) strategy is introduced to denoise raw CSI data by leveraging signal decomposition. Secondly, in order to characterize human’s gaits profoundly, several representative features are exploited from different statistical profiles, including channel power distribution in time domain (CPD), time-frequency analysis (TFA), and energy distribution in different frequency bands (ED). Thirdly, a recurrent neural network (RNN) model with long short-term memory (LSTM) blocks is employed to learn the representative gait features extracted above and encode temporal information for realizing human identity identification. The proof-of-concept prototype of the proposed Wihi approach is implemented on a set of commercial WiFi devices, and multiple comprehensive experiments have been carried out to evaluate the performance of identity identification. The experimental results confirm that the proposed Wihi can achieve a satisfactory performance compared with some state-of-the-art approaches.

The basic idea of our proposed approach is to extract representative features for RNN training, and then perform identity identification. Moreover, the proposed Wihi appr...

Published in: IEEE Access ( Volume: 8)

Page(s): 129246 - 129262

Date of Publication: 14 July 2020

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2020.3009123

Funding Agency:

Citations are not available for this document.

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

Human identity identification has been researched for many years and is of great importance for many applications, such as smart home, indoor intrusion detection, building monitoring, etc. In order to identify different people, many identity identification approaches have been proposed with different techniques such as gait-based [1]–[3], fingerprint-based [4]–[6], face recognition-based [7], [8] and iris-based [9]–[11] approaches. Generally, these biological characteristics are representative and unique for everyone and can provide a high accuracy of identity identification, and therefore they can be widely applied to security systems. For instance, the characteristics are able to be used in the security systems to conduct identity identification when someone has access to a certain office or laboratory. Although these identification approaches have shown great promise in their applications, they suffer from a number of limitations such as needing light, personal privacy problem, high energy consumption, high installation overhead, requiring the dedicated sensor or device, etc. Consequently, these disadvantages somehow restrict their large-scale deployment in door environments (e.g., smart home and office). With the rapid development and ubiquity of commercial WiFi devices in typical indoor environments, there are increasingly applications utilizing channel state information (CSI) [12]–[27]. Since an entity walking between a pair of transmitter and receiver could generate significant impacts on the characteristics of WiFi signal, identity identification can be feasible utilizing WiFi CSI. In addition, owing to low power consumption, easy installation, no invasion, and large-scale deployment of commercial WiFi devices in indoor environments, identity identification is able to become reality. Furthermore, the passive device-free human identity identification approach using WiFi signal does not require users to take any sensor or device. Naturally, it is an ideal one compared with the traditional approaches.

It is well known that everyone’s natural walking patterns (i.e. gaits) are particular, which can be characterized by the differences in human’s height, body mass, and moving speed. When an entity walks in a target area, his/her gaits could affect the indoor electromagnetic environments in a unique manner which would be changed into the impacts on the characteristics of WiFi signal, and then the significant impacts are in turn manifested as distinct perturbation in WiFi CSI. Since human’s gaits are highly distinguishing for different people, it is possible to identify one person from a group of people examining his/her representative statistics features exhibited in WiFi CSI data. However, there are still many challenges we face in WiFi CSI based human identity identification. The first one is how to obtain effective CSI data. The CSI data collected is obtained from commercial WiFi Network Interface Cards (NICs), and thus it contains the random noise from various sources, such as nearby electronic devices. This random noise is able to add false edges, and then further influence the accuracy of identity identification and robustness. Therefore, it is a challenging problem that how to preserve signal details while filtering out the noise components in the WiFi signal efficiently. The second one is how to obtain multiple representative features for characterizing human’s gaits profoundly. The previous works [28], [29] have shown that human’s walking patterns can be described by a number of statistics features in time and frequency domain. However, the common features such as median value, mean value, maximum value, minimum value, variance, and entropy are not effective because they can be influenced easily by the random noise. In view of this, the identification accuracy is not satisfactory. Now, a natural question to ask: which gait features are better for identity identification? The third one is how to utilize the representative gait features extracted to identify different people effectively. It should be noted that feature extraction and identity identification are not jointly optimized. That is to say, it is not enough to rely on the representative gait features alone, and we should seek for suitable method to conduct identity identification.

To deal with these challenges, in this paper, a passive device-free WiFi CSI based human identity identification approach using recurrent neural network (Wihi) is proposed. To address the first challenge, discrete wavelet transform (DWT) [30] strategy is employed to eliminate the influence of the random noise through signal decomposition. It helps to reduce the interference of both complicated background environment and the random noise effectively. After that, the aim of improving identification quality can be achieved. To address the second one, we propose to extract three representative gait features in time and frequency domain for characterizing human’s walking patterns profoundly, including channel power distribution in time domain (CPD), time-frequency analysis (TFA), and energy distribution in different frequency bands (ED). To address the third one, a recurrent neural network (RNN) [31]–[33] model with long short-term memory (LSTM) blocks is introduced to learn the representative gait features above and encode temporal information for identifying different people. As the term recurrent implies, the proposed RNN model takes not only the current input data but also several previous input data. In other words, it has a memory that obtains the variation in input data. Therefore, the proposed RNN model can capture the complicated non-linear relationship between input and output data in the training phase efficiently. As a result, an entity can be identified from a group of people accurately.

Real experiments have been conducted to verify the high performance of the proposed Wihi approach. Besides, the identification performance is also compared with the existing approaches. In summary, the main contributions of the paper are as follows:

We propose Wihi, a novel passive device-free CSI based human identity identification using deep learning, which is capable of identifying different people and achieves excellent identification performance compared with the existing state-of-the-art approaches.
DWT is introduced to eliminate the influence of the random noise presented in the raw WiFi CSI data while preserving data details through signal decomposition.
Unlike the existing CSI based approaches, we extract several representative gait features from both time and frequency domain, including CPD, TFA, and ED, which can better characterize human’s walking patterns. Thus, this helps improving identity identification accuracy.
To identify different people accurately, the RNN model with LSTM blocks is used to identify different people by learning the gait features extracted, instead of raw CSI data. This can reduce the influence of the random noise derived from indoor environments significantly.

The remaining paper is organized as follows. We first review the related work in Section II, and Section III shows the basic background knowledge of WiFi CSI. Section IV illustrates the architecture and design of the proposed Wihi approach. Section V describes the raw CSI data collection for experiments and presents the experimental setups. Then, the experimental results are presented in this section. We discuss the limitations related to our approach in Section VI followed by a conclusion in Section VII.

SECTION II.

Related Work

Owing to the importance of human identity identification, a broad range of identity identification approaches have been proposed these years that can applied to different indoor environments. Most identity identification approaches utilize biometric characteristics such as face, iris, fingerprint, and gaits. Especially, these biometric characteristics are widely used in user authentication because they are distinguishing among different people and very stable across different time. The researchers in [1] proposed a novel human identification approach from long range gaits profiles in surveillance videos. Concretely, they investigated the role of multi view gaits images acquired from multiple cameras, importance of infrared and visible range images in ascertaining identity, and role of soft/secondary biometric in enhancing the accuracy and robustness of the identification systems. Hossain and Chetty [2] proposed a novel multi-view feature fusion of gait biometric information in surveillance videos for large-scale human identification. A fingerprint classification algorithm can be found in [4], and fingerprints were classified into five categories: arch, tented arch, left loop, right loop and whorl. Then the algorithm extracted singular points in a fingerprint image, and further conducted classification based on the number and locations of the detected singular points. A new structural approach to the fingerprint classification problem was presented in [5]. The fingerprint directional image was segmented into multiple regions by minimizing the variance of the element directions within the regions. Jain et al. [6] presented a fingerprint classification algorithm which can achieve a better performance than previously reported. The proposed algorithm used a novel representation and was based on a two-stage classifier to conduct a classification. Yang et al. [8] presented a new robust face-matching method with multi-feature fusion, combining the rotation-invariant texture feature vector, the scale-invariant feature transform vector, and the convolution neural network. Park and Park [9] proposed a novel iris recognition method based on score level fusion which used two Gabor wavelet filters and SVM. Galdi et al. [10] got around the sensor interoperability problem utilizing on the picture differences due to acquisition by different sensors, and then presented a novel system that combined the recognition of user’s iris and user’s device. Lee et al. [11] proposed a novel recognition approach for noisy iris and ocular images by leveraging one iris and two periocular regions, based on three convolutional neural networks. Although the identification accuracies of these approaches above were relatively high, they all required dedicated device or sensor, which could lead to high cost and limit their wide deployment.

WiFi CSI has been verified to be a reliable indicator for passive device-free identity identification and it has the special advantages of low cost, no invasion, and wide deployment in indoor environments, and thus WiFi CSI based human identity identification approaches have been widely studied [15]–[27]. The authors in [15] proposed a novel approach for device-free passive detection of moving humans with dynamic Speed. Concretely, both amplitude and phase information of CSI were extracted and shaped into sensitive metrics for target detection, and then CSI across multi-antennas in multiple input multiple output (MIMO) systems were further exploited to improve the detection accuracy and robustness. Xi et al. [16] presented a device-free based on CSI crowd counting approach, and this design was motivated by the observation that CSI was highly sensitive to the indoor environment variation. Wu et al. [17] proposed a unified approach for non-invasive detection of stationary and moving human using commercial WiFi devices. This approach took full use of both amplitude and phase information of CSI to detect stationary or moving targets. Lv et al. [18] proposed an accurate approach for speed independent device-free entity detection which was suitable for intrusion detection even when the entity’s moving speed was relatively slow. Domenico et al. [19] presented a WiFi CSI based device-free crowd counting and occupancy estimation approach that can be leveraged in several typical indoor environments different from the ones in which the training process has been performed. Xin et al. [20] proposed a novel approach for human identification, which took advantage of WiFi signals to perform non-intrusive human identification in domestic environments. It is based on the observation that each person has distinguishing influence patterns to the surrounding WiFi signal while moving indoors, regarding their body shape characteristics and motion patterns. The researchers in [21] presented a passive WiFi CSI based identity identification approach utilizing human’s gaits based on CSI of WiFi signals. Zou et al. [22] presented a human identification system that leveraged the measurements from existing WiFi-enabled Internet of Things (IoT) devices and produced the identity estimation via a novel sparse representation learning technique, and utilized the unique fine-grained gait patterns of each person revealed from the WiFi CSI measurements as the ”fingerprint” for human identification. Wang et al. [23] designed a deep learning method to analyze the gait features using CSI of COTS WiFi devices. Specially, the convolution layers were combined with LSTM layers to extract gait features automatically from CSI data and to identify persons, which effectively reduced the need for a large amount of data preprocessing by manual feature extraction. Motivated by the observation that PHY layer CSI is capable of capturing the frequency diversity of wideband channel, Hong et al. [25] proposed a novel feature of subcarrier-amplitude frequency (SAF). Based on this feature, the proposed approach realized human identification through a linear-kernel SVM. Liu et al. [26] presented a fine-grained device-free framework that can distinguish different actions and identify persons within a short duration using WiFi signal. To extract intrinsic features from the noisy CSI so as to realize high-performance device-free identification (DFI), Wang et al. [27] proposed a novel empirical-mode-decomposition-based identity identification framework, which decomposed raw noisy CSI measurements into intrinsic mode functions (IMF) and extracted intrinsic features from the IMF components accordingly. Zhang et al. [28] proposed a novel approach that analyzed the CSI data to extract unique features that were representative of the walking patterns of that individual, and thus allowed the system to uniquely identify that person uniquely. Zeng et al. [29] presented a framework that can identify a person from a group of people in a device-free manner using WiFi, and showed that CSI used in recent WiFi identified a person’s steps and gaits. Although these human identity identification approaches can guarantee certain accuracies as presented, they were interfered severely by the influence of the random noise derived from indoor environments, which could lead to a bad identification performance.

Different from these approaches above, we leverage DWT strategy to suppress the random noise presented in the raw CSI data. Based on this, several representative gait features are extracted from time and frequency domain to characterize human’s gaits. Furthermore, the proposed RNN model with LSTM blocks is used to learn the representative gait features extracted for identifying different persons from a group of people effectively. Thus, compared with most of the existing identification approaches, our proposed approach is able to access superior performance with regard to the robustness to the random noise and the accuracy of identity identification.

SECTION III.

Preliminary

In this section, a short overview of CSI is presented. The most of commercial WiFi devices operate on both the 2.4 GHz and 5 GHz frequency bands and also support MIMO techniques. In addition, the modern off-the-shelf devices also leverage orthogonal frequency division modulation (OFDM) to obtain fine-grained channel measurements at the physical layer. Specially, the OFDM channel is divided into multiple subcarriers where each subcarrier has a different signal amplitude and phase with regard to each transmitted signal. Generally, the mainstream WiFi systems are based on OFDM such as 802.11 a/g/n where a relatively wideband 20 MHz channel is partitioned into 52 subcarriers. Owing to the frequency diversity of these subcarriers, both the shadow fading and multipath effect caused by minute movements at different narrowband subcarriers could lead to different amplitude and phase totally. Any time we move, we create waves in this sea of WiFi signal, so it can be known that a small body movement in indoor environments could result in the drastic change of CSI at all the subcarriers. Our proposed approach thus takes advantage of the fine-grained CSI to capture the minute movement for identity identification.

Considering that surrounding objects (e.g., furniture and wall) in indoor environments can reflect WiFi signal with different intensities, the transmitted signal arrives at the receiver through multiple different paths where each of them can introduce a different time delay, amplitude attenuation, and phase shift. Thus, the channel impulse response (CIR) can be described as follows: $\begin{equation*} h\left ({\tau }\right)=\sum \limits _{i=1}^{N} {a_{i}} \text {e}^{-j\theta _{i}}\delta \left ({{\tau -\tau _{i}} }\right)\!,\tag{1}\end{equation*}$ View Source where $N$ denotes the total number of paths, $a_{i}$ , $\theta _{i}$ , and $\tau _{i}$ are the amplitude attenuation, phase shift, the propagation time delay of the $i$ -th multipath component, and $\delta (\tau)$ is the Dirac delta function, respectively. Alternatively, in frequency domain, the transmitting channel can be modeled by channel frequency response (CFR), which consists of two parts with regard to amplitude-frequency response and phase-frequency response. For that, CFR can be derived by using the Fast Fourier Transform (FFT) of CIR: $\begin{equation*} \mathbf {H}=FFT\left ({{h\left ({\tau }\right)} }\right)\!.\tag{2}\end{equation*}$ View Source

With commercial WiFi Network Interface Cards such as Intel 5300 and slight firmware modification, a group of subcarriers channel measurements can be obtained in the format of CSI: $\begin{align*}&\hspace {-0.5pc}\mathbf {H}=\left [{ {H_{1} e^{j\angle H_{1}},H_{2} e^{j\angle H_{2} },\ldots H_{l} e^{j\angle H_{l}},\ldots H_{30} e^{j\angle H_{30}}} }\right]^{T} \\& \qquad\qquad\qquad\qquad\qquad\quad \qquad \qquad \qquad\displaystyle {l\in \left [{ {1,30} }\right]\!,} \tag{3}\end{align*}$ View Source where $\left [{ \cdot }\right]^{T}$ represents the transpose operation, $H_{l}$ and $\angle H_{l}$ are the amplitude and phase of the $l$ -th subcarrier, respectively. Generally, the continuous raw CSI data of the $l$ -th subcarrier is collected, and the length of sliding time window is set as $m$ . It can be given by $\begin{align*}&\hspace {-0.5pc}\mathbf {H}_{l} =\left [{ {H_{1,l} e^{j\angle H_{1,l}},\ldots H_{\kappa,l} e^{j\angle H_{\kappa,l}},\ldots H_{m,l} e^{j\angle H_{m,l}}} }\right] \\& \qquad \qquad\qquad \qquad\qquad \qquad \qquad \qquad\displaystyle {\kappa \in \left [{ {1,m} }\right]\!,} \tag{4}\end{align*}$ View Source where $\mathbf {H}_{l}$ has a dimension of $1\times m$ , $H_{\kappa,l}$ and $\angle H_{\kappa,l}$ are the amplitude and phase of the $\kappa$ -th data point of the $l$ -th subcarrier.

SECTION IV.

Scheme Design

A. Overview

Our proposed Wihi approach only uses a pair of transmitter and receiver devices to collect the raw CSI data for human identification. Fig. 1 illustrates the overall architecture and block diagram of the proposed Wihi approach. It is assumed that a person without taking specified sensor or device walks in the target area. At the same time, the collected raw CSI data at the receiver would be constantly analyzed to conduct identity identification. Therefore, the proposed Wihi can be divided into three main blocks, including data preprocessing, feature extraction, and human identification.

FIGURE 1.

The architecture of the proposed Wihi approach.

Wihi: WiFi Based Human Identity Identification Using Deep Learning

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

Related Work

Preliminary

Scheme Design

A. Overview

B. Data Preprocessing

C. Feature Extraction

1) Channel Power Distribution

2) Time-Frequency Analysis

3) Energy Distribution

D. RNN Model

Performance and Evaluation

A. Data Collection

B. Identification Results

C. Results With Different Approaches

D. Accuracy of Different People

E. Impact of Different Numbers of Features

F. Impact of Different Walking Paths

G. Impact of the Number of Hidden Nodes

H. Impact of Presence of Other Humans

I. Impact of Window Size

J. Impact of Different Walking Speed

K. Identification Accuracy of the Data Mixed from Two Indoor Environments

L. Identification Accuracy of the Data Mixed from Different Walking Paths

M. Impact of the Random Noise

Limitations

A. Multi-Target Identification

B. Walking Path

C. The Testing Range

Conclusion

Cites in Papers - IEEE (21) | Other Publishers (7)

Cites in Papers - IEEE (21)

Cites in Papers - Other Publishers (7)

References

Cites in Papers - |