Journals & Magazines >IEEE Transactions on Intellig... >Volume: 21 Issue: 3

Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks fur...Show More

Metadata

Abstract:

Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, the centralized RL is infeasible for large-scale ATSC due to the extremely high dimension of the joint action space. The multi-agent RL (MARL) overcomes the scalability issue by distributing the global control to each local RL agent, but it introduces new challenges: now, the environment becomes partially observable from the viewpoint of each local agent due to limited communication among agents. Most existing studies in MARL focus on designing efficient communication and coordination among traditional Q-learning agents. This paper presents, for the first time, a fully scalable and decentralized MARL algorithm for the state-of-the-art deep RL agent, advantage actor critic (A2C), within the context of ATSC. In particular, two methods are proposed to stabilize the learning procedure, by improving the observability and reducing the learning difficulty of each local agent. The proposed multi-agent A2C is compared against independent A2C and independent Q-learning algorithms, in both a large synthetic traffic grid and a large real-world traffic network of Monaco city, under simulated peak-hour traffic dynamics. The results demonstrate its optimality, robustness, and sample efficiency over the other state-of-the-art decentralized MARL algorithms.

Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 21, Issue: 3, March 2020)

Page(s): 1086 - 1095

Date of Publication: 15 March 2019

ISSN Information:

DOI: 10.1109/TITS.2019.2901791

Funding Agency:

Tianshu Chu

Department of Civil and Environmental Engineering, Stanford University, USA

Jie Wang

Department of Civil and Environmental Engineering, Stanford University, USA

Lara Codecà

Communication Systems Department, EURECOM, Sophia-Antipolis, France

Zhaojian Li

Department of Mechanical Engineering, Michigan State University, East Lansing, USA

Contents

I. Introduction

As a consequence of population growth and urbanization, the transportation demand is steadily rising in the metropolises worldwide. The extensive routine traffic volumes bring pressures to existing urban traffic infrastructure, resulting in everyday traffic congestions. Adaptive traffic signal control (ATSC) aims for reducing potential congestions in saturated road networks, by adjusting the signal timing according to real-time traffic dynamics. Early-stage ATSC methods solve optimization problems to find efficient coordination and control policies. Successful products, such as SCOOT [1] and SCATS [2], have been installed in hundreds of cities across the world. OPAC [3] and PRODYN [4] are similar methods, but their relatively complex computation makes the implementation less popular. Since the 90s, various interdisciplinary techniques have been applied to ATSC, such as fuzzy logic [5], genetic algorithm [6], and immune network algorithm [7].

Tianshu Chu

Department of Civil and Environmental Engineering, Stanford University, USA

Tianshu Chu received the B.S. degree in physics from Weseda University, Tokyo, Japan, in 2010, and the M.S. and Ph.D. degrees from the Department of Civil and Environmental Engineering, Stanford University, in 2012 and 2016, respectively. He is currently a Data Scientist with Uhana Inc., and also an Adjunct Professor with the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include reinforcement learning, deep learning, multi-agent learning, and their applications to traffic signal control, wireless network control, autonomous driving, and other engineering control systems.

Jie Wang

Department of Civil and Environmental Engineering, Stanford University, USA

Jie Wang received the B.S. degree from Shanghai Jiao Tong University, the M.S. degrees from Stanford University and the University of Miami, and the Ph.D. degree in civil and environmental engineering from Stanford University, in 2003, where he is currently an Adjunct Professor with the Department of Civil and Environmental Engineering, and also the Executive Director of the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include information and knowledge management for sustainable development and innovation, enterprise IT infrastructure management, smart manufacturing, smart infrastructures and smart city, and environmental informatics.

Lara Codecà

Communication Systems Department, EURECOM, Sophia-Antipolis, France

Lara Codecà received the master’s degree in computer sciences from the University of Bologna, Italy, in 2011, and the Ph.D. degree from the University of Luxembourg, in 2016. In 2011, she was a Visiting Fellow with Prof. Dr. Mario Gerla’s Vehicular Lab, University of California at Los Angeles, Los Angeles, CA, USA. She is currently a Post-Doctoral Fellow at the CATS Group, EURECOM, France. Her research interests include (cooperative) intelligent transportation systems, vehicular traffic modelling, and big-data analysis. She is active in the SUMO community and collaborates with the SUMO developers at DLR (German Aerospace Centre).

Zhaojian Li

Department of Mechanical Engineering, Michigan State University, East Lansing, USA

Zhaojian Li received the bachelor’s degree from the Department of Civil Aviation, Nanjing University of Aeronautics and Astronautics, China, and the M.S. and Ph.D. degrees in aerospace engineering (flight dynamics and control) from the University of Michigan, Ann Arbor, MI, USA, in 2013 and 2015, respectively. From 2016 to 2017, he has worked as an Algorithm Engineer at General Motors. He is currently an Assistant Professor with the Department of Mechanical Engineering, Michigan State University. His research interests include learning-based control, nonlinear and complex systems, and robotics and automated vehicles. He was a recipient of the National Scholarship from China.

References is not available for this document.

MIT Libraries

MIT Libraries

Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?