Journals & Magazines >IEEE Access >Volume: 7

CNN and KPCA-Based Automated Feature Extraction for Real Time Driving Pattern Recognition

Driving pattern recognition process by CNN+KPCA.

Abstract:

Driving conditions greatly affect the energy control and the fuel economy of a hybrid electric vehicle (HEV). In this paper, an automated feature extraction scheme based ...Show More

Metadata

Abstract:

Driving conditions greatly affect the energy control and the fuel economy of a hybrid electric vehicle (HEV). In this paper, an automated feature extraction scheme based on convolution neural networks (CNNs) and Kernel PCA (KPCA) for real time driving pattern recognition (RTDPR) is proposed in order to achieve consistent performance of the energy management. Firstly, a dimension expanding strategy is performed to transform one-dimensional speed sequences to generate a two-dimensional dataset. Then, the transformed data is sent to the CNN and KPCA based feature extractor. Finally, the feature extractor automatically selects the most representative features for classification. To improve the generalization of CNN to a small sample dataset, the structure of the typical CNN is adjusted by adding the KPCA layer in order to reduce model parameters. The model is well trained and evaluated in simulation, and it is tested for RTDPR in the real world. Simulation and experimental results show that the proposed automated feature extraction strategy outperforms the conventional driving pattern recognition algorithms based on manually feature extraction, which has achieved the state-of-the-art recognition accuracy.

Driving pattern recognition process by CNN+KPCA.

Published in: IEEE Access ( Volume: 7)

Page(s): 123765 - 123775

Date of Publication: 02 September 2019

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2019.2938768

Funding Agency:

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

A driving pattern is typically defined as the driving cycle of a vehicle in a particular environment [1], [2]. Since the current driving pattern has a great impact on the energy management strategy of a hybrid electric vehicle (HEV) [3], [4], it is efficient to use the prior knowledge of the driving cycle to achieve the real time driving pattern recognition (RTDPR) and enhance the control performance of the HEV [5], [6]. There are many researches on the RTDPR [2], [7]–[10]. The conventional way is to manually extract features from the historical speed data to characterize the driving patterns [2]. Then the classical machine learning models like k-means [7], hidden Markov models [8], fuzzy c-means [9], and their variants [10] are fully utilized to classify the extracted features into different categories. Therefore, the quality of the feature extraction algorithm plays a great impact on the classification accuracy. However, those manually extracted features usually include average speeds, average accelerations and other features which are directly calculated using physical models [11], while other complex and high level features are hard to represent. In practice, those low level features are unable to effectively characterize the complex driving patterns. Additionally, to reduce time cost of RTDPR, a limited amount e.g., 16 features are selected to characterize the driving patterns [12]. Based on the above analysis, it can be concluded that the recognition accuracy of the conventional methods is significantly affected by selected features. Recently with the development of deep learning and its strong classification ability [13]–[15], the convolution neuron network (CNN) has been wildly used in the pattern recognition fields [16]–[18], and achieved good performances. The CNN can achieve an end-to-end recognition without feature extraction but still has not been widely applied in RTDPR, partially due to the lack of magnanimous training samples. Motivated by the CNN, we do not manually generate the feature vectors from the historical speed data to build the model. Instead, the model learns to extract the features itself from the datasets [19]. During the training process, the model can learn to select the most representative features and their amount automatically. The simulation results indicate that the features selected automatically by the models are more representative than those that are manually designed. The standard CNN is a nonlinear model with typically thousands of parameters, which may easily get overfitting when the training samples are not sufficient [20]. The most parameters concentrate on the fully-connected layers which hold much redundancy. To solve the problem, we design an automated feature extractor that retains the former part of the CNN and removes the fully-connected layer. Then the kernel PCA (KPCA) layer is added to further supply features, thus the redundancy is removed and classification is simplified. Additionally, we have performed linear shift on the speed data to expand the dataset, which also proves to be very effective to avoid overfitting.

In this work, we firstly collect the training samples from the historical speed data by a sliding window. The size and step of the window are adjusted in the training process. Secondly, we transform the training samples to the two-dimension dataset so that the CNN based model can effectively deal with the speed information. Thirdly, the two-dimension dataset are divided into batches to fit the feature extractor. Finally, the extracted features are utilized for RTDPR. The specific contributions of this paper are as follows: (1) We have improved the generalization of the standard CNN for small dataset by adding the KPCA layer. (2) We have achieved an end-to-end strategy for RTDPR instead of manually designing features. (3) The historical speed sequence is transformed to two-dimension to extract spatial features. (4) We have achieved the state-of-the-art accuracy for RTDPR.

The structure of this paper is as follows: the details of the CNN + KPCA architecture are described in section II. Then our model based on CNN + KPCA is reported in section III. Section IV presents the applications on four typical patterns in the congested urban, flowing urban, subway and high way and in real environment. The results are compared with that of other typical classifiers. Finally, section V gives the conclusions of this paper.

SECTION II.

The CNN + KPCA Architecture

A. The Standard CNN Classifier

The CNN model is a complex nonlinear function that maps the input samples into the corresponding driving patterns. The overall structure of CNN is described in Fig. 1, which includes one input layer, the complex middle layers and one output layer. The input layer of the CNN deals with the two-dimension samples. The middle layers include the convolution layers and a fully-connected layer. Within the convolution layer, the convolution operation is performed, followed by the max-pooling operation immediately. The outputs of the last convolution layer are then flattened to one-dimension as the inputs of the fully connected layer for further nonlinearization. In the output layer, there contain four neurons that delegate different driving patterns. The details of the calculation process are described as follows.

FIGURE 1.

The architecture of typical CNN classifier.

CNN and KPCA-Based Automated Feature Extraction for Real Time Driving Pattern Recognition

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

The CNN + KPCA Architecture

A. The Standard CNN Classifier

B. The CNN + KPCA Feature Extractor

Model Building

A. Typical CNN Model Building

B. CNN + KPCA Model Building

Case Study

A. Typical Driving Pattern

B. The Dataset Process

C. Hyper Parameters

D. Results Analysis

E. Comparison With Other Classifiers

1) Typical CNN

2) CNN + PCA

3) K-Nearest Neighbor

4) Multilayer NN

5) Kernel PCA Based Multilayer NN

F. Real Driving Pattern Recognition

Conclusion

References