Loading [MathJax]/extensions/MathMenu.js
Anti-Disturbance Self-Supervised Reinforcement Learning for Perturbed Car-Following System | IEEE Journals & Magazine | IEEE Xplore

Anti-Disturbance Self-Supervised Reinforcement Learning for Perturbed Car-Following System


Abstract:

This paper proposes an anti-disturbance car-following strategy for attenuating (i) exogenous disturbances from preceding traffic oscillations and (ii) endogenous disturba...Show More

Abstract:

This paper proposes an anti-disturbance car-following strategy for attenuating (i) exogenous disturbances from preceding traffic oscillations and (ii) endogenous disturbances in vehicular control systems (e.g., wind gust, ground friction, and rolling resistance). Firstly, it employs a modified robust controller to generate an expert car-following control experience. Subsequently, it imitates the expert behaviors via the behavioral cloning (BC) technique, thereby developing the anti-disturbance ability. Lastly, the obtained policy is optimized using the self-supervised reinforcement learning (RL) approach. The simulation experiments, comprising both training and evaluation phases, are performed via Python. To simulate car-following scenarios, we utilize the ground-truth data from the Next Generation Simulation (NGSIM) datasets. Through recursive interactions with the perturbed car-following environment, self-supervised RL drives stable policy improvement. The proposed anti-disturbance self-supervised RL (ADSSRL) policy presents a smooth and almost monotonously increasing reward curve. Further evaluation of disturbance dampening performance suggests that at least a 44.5% reduction in control efficiency cost and a 10.1% reduction in driving comfort cost are achieved compared with baselines.
Published in: IEEE Transactions on Vehicular Technology ( Volume: 72, Issue: 9, September 2023)
Page(s): 11318 - 11331
Date of Publication: 26 April 2023

ISSN Information:

Funding Agency:


I. Introduction

Connected autonomous vehicles (CAV), equipped with advanced communication and automation capability, have the potential to improve traffic capacity, reduce crash risks, and improve driving comfort [1], [2], [3]. As one of the most elementary driving tasks, CAV car-following control usually suffers from two kinds of disturbances. One is exogenous disturbances from preceding traffic oscillations. Speed disturbances might be amplified in a vehicle platoon, resulting in low fuel efficiency, increased crash likelihood, and severe traffic congestion [4], [5], [6]. The other is endogenous disturbances originating from wind gust, ground friction, rolling resistance, and uncertainty of vehicular parameters [7]. These factors make CAV control sluggish, unstable, or even error-prone because precise vehicular movement control (e.g., achieving expected acceleration) is affected.

Contact IEEE to Subscribe

References

References is not available for this document.