Loading [MathJax]/extensions/MathZoom.js
Deep Reinforcement Learning for Autonomous Driving: A Survey | IEEE Journals & Magazine | IEEE Xplore

Deep Reinforcement Learning for Autonomous Driving: A Survey


Abstract:

With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex p...Show More

Abstract:

With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments. This review summarises deep reinforcement learning (DRL) algorithms and provides a taxonomy of automated driving tasks where (D)RL methods have been employed, while addressing key computational challenges in real world deployment of autonomous driving agents. It also delineates adjacent domains such as behavior cloning, imitation learning, inverse reinforcement learning that are related but are not classical RL algorithms. The role of simulators in training agents, methods to validate, test and robustify existing solutions in RL are discussed.
Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 23, Issue: 6, June 2022)
Page(s): 4909 - 4926
Date of Publication: 09 February 2021

ISSN Information:

Citations are not available for this document.

I. Introduction

Autonomous driving (AD) systems constitute of multiple perception level tasks that have now achieved high precision on account of deep learning architectures. Besides the perception, autonomous driving systems constitute of multiple tasks where classical supervised learning methods are no more applicable. First, when the prediction of the agent’s action changes future sensor observations received from the environment under which the autonomous driving agent operates, for example the task of optimal driving speed in an urban area. Second, supervisory signals such as time to collision (TTC), lateral error w.r.t to optimal trajectory of the agent, represent the dynamics of the agent, as well uncertainty in the environment. Such problems would require defining the stochastic cost function to be maximized. Third, the agent is required to learn new configurations of the environment, as well as to predict an optimal decision at each instant while driving in its environment. This represents a high dimensional space given the number of unique configurations under which the agent & environment are observed, this is combinatorially large. In all such scenarios we are aiming to solve a sequential decision process, which is formalized under the classical settings of Reinforcement Learning (RL), where the agent is required to learn and represent its environment as well as act optimally given at each instant [1]. The optimal action is referred to as the policy.

For easy reference, the main acronyms used in this article are listed in Appendix (Table IV).

Cites in Papers - |

Cites in Papers - IEEE (525)

Select All
1.
Younghwan Lee, Tung M. Luu, Donghoon Lee, Chang D. Yoo, "Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning", ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.1-5, 2025.
2.
Taeyoung Kim, Taemin Kang, Seungah Son, Kuk Won Ko, Dongsoo Har, "Goal-Conditioned Reinforcement Learning Approach for Autonomous Parking in Complex Environments", 2025 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), pp.0465-0470, 2025.
3.
Kai Yuan Tan, Saw Chin Tan, Teong Chee Chuah, "A Multi-Phase DRL-Driven SDN Migration Framework Addressing Budget, Legacy Service Compatibility, and Dynamic Traffic", IEEE Access, vol.13, pp.33202-33219, 2025.
4.
Shikhar Singh Lodhi, Neetesh Kumar, Pradumn Kumar Pandey, "Dynamic Option Policy Enabled Hierarchical Deep Reinforcement Learning Model for Autonomous Overtaking Maneuver", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.4, pp.5018-5029, 2025.
5.
Fan Guo, Xiao Han, Kang Song, Kaichen Jiang, Dezong Zhao, Jinbo Hao, Caimei Wang, Hui Xie, "Game Theory-Based Harmonious Decision-Making for Autonomous Bus Lane Change", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.4, pp.4934-4947, 2025.
6.
Zhuo Li, Derui Zhu, Jens Grossklags, "Safe Reinforcement Learning via Episodic Control", IEEE Access, vol.13, pp.35270-35280, 2025.
7.
Jeong-Hwan Choi, Dong-han Kim, Ji-Sang Yoo, Beom-Joon Kim, Jun-Tae Hwang, "Enhancing Autonomous Driving with Pre-trained Imitation and Reinforcement Learning", 2025 International Conference on Electronics, Information, and Communication (ICEIC), pp.1-3, 2025.
8.
Haohan Yang, Yanxin Zhou, Jingda Wu, Haochen Liu, Lie Yang, Chen Lv, "Human-Guided Continual Learning for Personalized Decision-Making of Autonomous Driving", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.4, pp.5435-5447, 2025.
9.
Mandil Pradhan, Brent Hoover, April Valdez, Henry Griffith, Heena Rathore, "Integrating Justice Theory into Moral Decision-Making for Autonomous Vehicles", 2025 IEEE International Conference on Consumer Electronics (ICCE), pp.1-6, 2025.
10.
Ze-Ming Wu, Zheng Li, Hai-Biao Chen, Xiao-Chun Li, Hai-Bing Zhan, Ken Ning, "Design of Wideband Microstrip-to-Microstrip Vertical Transition With Pixel Structures Based on Reinforcement Learning", IEEE Microwave and Wireless Technology Letters, vol.35, no.3, pp.274-277, 2025.
11.
Yu Zhang, Xinyue Li, Wang Hu, Gary G. Yen, "Multiexpression Symbolic Regression and Its Circuit Design Case", IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol.55, no.3, pp.2250-2263, 2025.
12.
Yuxiang Yang, Fenglong Ge, Jinlong Fan, Jufeng Zhao, Zhekang Dong, "CDRP3: Cascade Deep Reinforcement Learning for Urban Driving Safety With Joint Perception, Prediction, and Planning", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.3, pp.3976-3988, 2025.
13.
Yongjun Yan, Dongming Han, Qixiang Zhang, Jinxiang Wang, Dawei Pi, Duanfeng Chu, Guodong Yin, "Event-Triggered Personalized Driving Based on Passenger’s Subjective Risk Evaluation", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.2, pp.1982-1998, 2025.
14.
Khaled Alkilane, Der-Horng Lee, "MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.1, pp.810-823, 2025.
15.
Xiangnan Zhong, Zhen Ni, "A Two-Level Neural-RL-Based Approach for Hierarchical Multiplayer Systems Under Mismatched Uncertainties", IEEE Transactions on Artificial Intelligence, vol.6, no.3, pp.759-772, 2025.
16.
Amirhossein Zolfagharian, Manel Abdellatif, Lionel C. Briand, Ramesh S, "SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents", IEEE Transactions on Software Engineering, vol.51, no.1, pp.82-105, 2025.
17.
Kai Yang, Shen Li, Yongli Chen, Dongpu Cao, Xiaolin Tang, "Towards Safe Decision-Making for Autonomous Vehicles at Unsignalized Intersections", IEEE Transactions on Vehicular Technology, vol.74, no.3, pp.3830-3842, 2025.
18.
Yuxiao Cao, Xiangrui Zeng, Zhouping Yin, "A Game Theoretic Decision-Making Framework With Conflict-Aware Nash Equilibrium Selection for Autonomous Vehicles at Uncontrolled Intersections", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.1, pp.210-224, 2025.
19.
Anselme Ndikumana, Kim Khoa Nguyen, Mohamed Cheriet, "Digital Twin Backed Closed-Loops for Energy-Aware and Open RAN-Based Fixed Wireless Access Serving Rural Areas", IEEE Transactions on Mobile Computing, vol.24, no.3, pp.1669-1683, 2025.
20.
Xiaoxue Yu, Rongpeng Li, Chengchao Liang, Zhifeng Zhao, "Communication-Efficient Soft Actor–Critic Policy Collaboration via Regulated Segment Mixture", IEEE Internet of Things Journal, vol.12, no.4, pp.3929-3947, 2025.
21.
Zhaolin Yuan, ZiXuan Zhang, Xiaorui Li, Yunduan Cui, Ming Li, Xiaojuan Ban, "Controlling Partially Observed Industrial System Based on Offline Reinforcement Learning—A Case Study of Paste Thickener", IEEE Transactions on Industrial Informatics, vol.21, no.1, pp.49-59, 2025.
22.
Zhongyu Rao, Yingfeng Cai, Hai Wang, Long Chen, Yicheng Li, Qingchao Liu, "A Camera-Based End-to-End Autonomous Driving Framework Combined With Meta-Based Multi-Task Optimization", IEEE Transactions on Transportation Electrification, vol.11, no.1, pp.4443-4455, 2025.
23.
Mozhgan Nasr Azadani, Azzedine Boukerche, "Hierarchical Transformers for Motion Forecasting Based on Inverse Reinforcement Learning", IEEE Transactions on Vehicular Technology, vol.74, no.3, pp.3751-3764, 2025.
24.
Shuangqi Li, Alexis Pengfei Zhao, Chenghong Gu, Siqi Bu, Edward Chung, Zhongbei Tian, Jianwei Li, Shuang Cheng, "Interpretable Deep Reinforcement Learning With Imitative Expert Experience for Smart Charging of Electric Vehicles", IEEE Transactions on Power Systems, vol.40, no.2, pp.1228-1240, 2025.
25.
Sunan Zhang, Weichao Zhuang, Bingbing Li, Ke Li, Tianyu Xia, Bo Hu, "Integration of Planning and Deep Reinforcement Learning in Speed and Lane Change Decision-Making for Highway Autonomous Driving", IEEE Transactions on Transportation Electrification, vol.11, no.1, pp.521-535, 2025.
26.
Hamid Taghavifar, Chongfeng Wei, Leyla Taghavifar, "Socially Intelligent Reinforcement Learning for Optimal Automated Vehicle Control in Traffic Scenarios", IEEE Transactions on Automation Science and Engineering, vol.22, pp.129-140, 2025.
27.
Yuan Liang, Zitian Zhang, Chuhua Xian, Shengfeng He, "Delving Into Multi-Illumination Monocular Depth Estimation: A New Dataset and Method", IEEE Transactions on Multimedia, vol.27, pp.1018-1032, 2025.
28.
Yuting Liu, Hong Gu, Annan Zhang, Pan Qin, "Backdoor Attack Based on Lossy Image Compression Using Discrete Cosine Transform", IEEE Access, vol.12, pp.196488-196497, 2024.
29.
Huifeng Hu, Jicheng Liu, Hongyue Gao, Xinglin Wang, "Research on Image Depth Estimation Based on PatchMatchNet in the Field of Holographic 3D Display", 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing (AIIM), pp.650-654, 2024.
30.
Martin Holen, Svitlana Rogovchenko, Gulshan Noorsumar, Morten Goodwin, "Predicting Vehicle Impact Severity With Deep Neural Network for Reinforcement Learning Based Autonomous Vehicle Simulators", 2024 International Conference on Machine Learning and Applications (ICMLA), pp.1553-1558, 2024.

Cites in Papers - Other Publishers (441)

1.
Zhengyu Zhu, Mengfei Gong, Gangcan Sun, Peijia Liu, De Mi, "AI-Enabled STAR-RIS Aided MISO ISAC Secure Communications", Tsinghua Science and Technology, vol.30, no.3, pp.998-1011, 2025.
2.
Zijun Cui, Andrew J. Hung, "What is artificial intelligence, machine learning, and deep learning: terminologies explained", Artificial Intelligence in Urology, pp.3, 2025.
3.
Sukhpal Singh Gill, Muhammed Golec, Jianmin Hu, Minxian Xu, Junhui Du, Huaming Wu, Guneet Kaur Walia, Subramaniam Subramanian Murugesan, Babar Ali, Mohit Kumar, Kejiang Ye, Prabal Verma, Surendra Kumar, Felix Cuadrado, Steve Uhlig, "Edge AI: A Taxonomy, Systematic Review and Future Directions", Cluster Computing, vol.28, no.1, 2025.
4.
David J. Wagg, "Modelling, Reductionism and the Implications for Digital Twins", Model Order Reduction for Design, Analysis and Control of Nonlinear Vibratory Systems, vol.614, pp.1, 2025.
5.
Asier Gonzalez-Santocildes, Juan-Ignacio Vazquez, "Unity-Based Autonomous Driving Environment: A Platform for\\xa0Validating Reinforcement Learning Agents", Hybrid Artificial Intelligent Systems, vol.14858, pp.280, 2025.
6.
Arka De, Sameeksha Saraf, Tusar Kanti Mishra, B.K. Tripathy, "Role of Machine Learning and Deep Learning in Smart Sensors", Smart Sensors for Industry 4.0, pp.161, 2025.
7.
Zhong Wang, Yahui Zhao, Yahui Zhang, Yang Tian, Xiaohong Jiao, "Safe off-policy reinforcement learning on eco-driving for a P2-P3 hybrid electric truck", Energy, pp.133884, 2024.
8.
Pushkala Jayaraman, Jacob Desman, Moein Sabounchi, Girish N. Nadkarni, Ankit Sakhuja, "A Primer on Reinforcement Learning in Medicine for Clinicians", npj Digital Medicine, vol.7, no.1, 2024.
9.
Qihong Chen, Rui Wang, Ming Lyu, Jie Zhang, "Transformer-Based Reinforcement Learning for Multi-Robot Autonomous Exploration", Sensors, vol.24, no.16, pp.5083, 2024.
10.
Yixuan Sun, Sami Khairy, Richard B. Vilim, Rui Hu, Akshay J. Dave, "A safe reinforcement learning algorithm for supervisory control of power plants", Knowledge-Based Systems, pp.112312, 2024.
11.
Peng Qin, Tao Zhao, "Knowledge guided fuzzy deep reinforcement learning", Expert Systems with Applications, pp.125823, 2024.
12.
Xi Xiong, Chun Shen, Junhong Wu, Shuai Lü, Xiaodan Zhang, "Combined data augmentation framework for generalizing deep reinforcement learning from pixels", Expert Systems with Applications, pp.125810, 2024.
13.
Bodong Tao, Jae-Hoon Kim, "Deep reinforcement learning-based local path planning in dynamic environments for mobile robot", Journal of King Saud University - Computer and Information Sciences, pp.102254, 2024.
14.
Amaira Arwa, Koubaa Hend, Zarai Faouzi, "DRL for handover in 6G-vehicular networks: A survey", Neurocomputing, pp.128971, 2024.
15.
Jingda Wu, Chao Huang, Hailong Huang, Chen Lv, Yuntong Wang, Fei-Yue Wang, "Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey", Transportation Research Part C: Emerging Technologies, vol.164, pp.104654, 2024.
16.
Mehran Berahman, Majid Rostami-Shahrbabaki, Klaus Bogenberger, , 2024.
17.
Huiping Liang, Junyao Xie, Biao Huang, Yonggang Li, Bei Sun, Chunhua Yang, "A novel sim2real reinforcement learning algorithm for process control", Reliability Engineering & System Safety, pp.110639, 2024.
18.
Ajitesh Gautam, Yuping He, Xianke Lin, "An Overview of Motion-Planning Algorithms for Autonomous Ground\ Vehicles with Various Applications", SAE International Journal of Vehicle Dynamics, Stability, and NVH, vol.8, no.2, 2024.
19.
Cong Xu, Ravi Sankar, "A Comprehensive Review of Autonomous Driving Algorithms: Tackling Adverse Weather Conditions, Unpredictable Traffic Violations, Blind Spot Monitoring, and Emergency Maneuvers", Algorithms, vol.17, no.11, pp.526, 2024.
20.
Nir Moneta, Shany Grossman, Nicolas W. Schuck, "Representational spaces in orbitofrontal and ventromedial prefrontal cortex: task states, values, and beyond", Trends in Neurosciences, 2024.
21.
yuguang tian, qiang li, gaogui xu, junwen chen, "Fast prediction method for dynamic RCS of rotary wing small UAVs", Advanced Fiber Laser Conference (AFL2023), pp.110, 2024.
22.
Zhihong Liu, Xin Xu, Peng Qiao, DongSheng Li, "Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey", ACM Computing Surveys, 2024.
23.
Jing Zhang, Xuejian Li, Ye Yuan, Dong Yang, Pengkai Xu, Francis T. K. Au, "A multi-agent ranking proximal policy optimization framework for bridge network life-cycle maintenance decision-making", Structural and Multidisciplinary Optimization, vol.67, no.11, 2024.
24.
М.І. Литвиненко, В.Г. Ленець, Н.В. Гармаш, В.В. Шульга, "Аспекти впровадження штучного інтелекту у військовій справі", Збірник наукових праць Харківського національного університету Повітряних Сил, no.2(80), pp.13, 2024.
25.
Richard Fox, Elliot A. Ludvig, "Assimilating human feedback from autonomous vehicle interaction in reinforcement learning models", Autonomous Agents and Multi-Agent Systems, vol.38, no.2, 2024.
26.
Mousa Tayseer Jafar, Lu-Xing Yang, Gang Li, "An innovative practical roadmap for optimal control strategies in malware propagation through the integration of RL with MPC", Computers & Security, pp.104186, 2024.
27.
Sarmad Ahmad Abbasi, Awais Ahmed, Seungmin Noh, Nader Latifi Gharamaleki, Seonhyoung Kim, A. M. Masum Bulbul Chowdhury, Jin-young Kim, Salvador Pané, Bradley J. Nelson, Hongsoo Choi, "Autonomous 3D positional control of a magnetic microrobot using reinforcement learning", Nature Machine Intelligence, 2024.
28.
Cheng Wang, Xiaoxian Cui, Shijie Zhao, Xinran Zhou, Yaqi Song, Yang Wang, Konghui Guo, "Enhancing vehicle ride comfort through deep reinforcement learning with expert-guided soft-hard constraints and system characteristic considerations", Advanced Engineering Informatics, vol.59, pp.102328, 2024.
29.
Yue (Sophie) Guo, Katia Sycara, "Enhancing Safety and Efficiency through Explainable Transfer Learning", Transfer Learning - Leveraging the Capability of Pre-trained Models Across Different Domains [Working Title], vol.0, 2024.
30.
Tong Li, Chenjia Bai, Kang Xu, Chen Chu, Peican Zhu, Zhen Wang, "Skill Matters: Dynamic skill learning for multi-agent cooperative reinforcement learning", Neural Networks, pp.106852, 2024.
Contact IEEE to Subscribe

References

References is not available for this document.