Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control | IEEE Journals & Magazine | IEEE Xplore

Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control


Abstract:

Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks fur...Show More

Abstract:

Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, the centralized RL is infeasible for large-scale ATSC due to the extremely high dimension of the joint action space. The multi-agent RL (MARL) overcomes the scalability issue by distributing the global control to each local RL agent, but it introduces new challenges: now, the environment becomes partially observable from the viewpoint of each local agent due to limited communication among agents. Most existing studies in MARL focus on designing efficient communication and coordination among traditional Q-learning agents. This paper presents, for the first time, a fully scalable and decentralized MARL algorithm for the state-of-the-art deep RL agent, advantage actor critic (A2C), within the context of ATSC. In particular, two methods are proposed to stabilize the learning procedure, by improving the observability and reducing the learning difficulty of each local agent. The proposed multi-agent A2C is compared against independent A2C and independent Q-learning algorithms, in both a large synthetic traffic grid and a large real-world traffic network of Monaco city, under simulated peak-hour traffic dynamics. The results demonstrate its optimality, robustness, and sample efficiency over the other state-of-the-art decentralized MARL algorithms.
Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 21, Issue: 3, March 2020)
Page(s): 1086 - 1095
Date of Publication: 15 March 2019

ISSN Information:

Funding Agency:

Author image of Tianshu Chu
Department of Civil and Environmental Engineering, Stanford University, USA
Tianshu Chu received the B.S. degree in physics from Weseda University, Tokyo, Japan, in 2010, and the M.S. and Ph.D. degrees from the Department of Civil and Environmental Engineering, Stanford University, in 2012 and 2016, respectively. He is currently a Data Scientist with Uhana Inc., and also an Adjunct Professor with the Stanford Center for Sustainable Development and Global Competitiveness. His research interests in...Show More
Tianshu Chu received the B.S. degree in physics from Weseda University, Tokyo, Japan, in 2010, and the M.S. and Ph.D. degrees from the Department of Civil and Environmental Engineering, Stanford University, in 2012 and 2016, respectively. He is currently a Data Scientist with Uhana Inc., and also an Adjunct Professor with the Stanford Center for Sustainable Development and Global Competitiveness. His research interests in...View more
Author image of Jie Wang
Department of Civil and Environmental Engineering, Stanford University, USA
Jie Wang received the B.S. degree from Shanghai Jiao Tong University, the M.S. degrees from Stanford University and the University of Miami, and the Ph.D. degree in civil and environmental engineering from Stanford University, in 2003, where he is currently an Adjunct Professor with the Department of Civil and Environmental Engineering, and also the Executive Director of the Stanford Center for Sustainable Development and...Show More
Jie Wang received the B.S. degree from Shanghai Jiao Tong University, the M.S. degrees from Stanford University and the University of Miami, and the Ph.D. degree in civil and environmental engineering from Stanford University, in 2003, where he is currently an Adjunct Professor with the Department of Civil and Environmental Engineering, and also the Executive Director of the Stanford Center for Sustainable Development and...View more
Author image of Lara Codecà
Communication Systems Department, EURECOM, Sophia-Antipolis, France
Lara Codecà received the master’s degree in computer sciences from the University of Bologna, Italy, in 2011, and the Ph.D. degree from the University of Luxembourg, in 2016. In 2011, she was a Visiting Fellow with Prof. Dr. Mario Gerla’s Vehicular Lab, University of California at Los Angeles, Los Angeles, CA, USA. She is currently a Post-Doctoral Fellow at the CATS Group, EURECOM, France. Her research interests include (...Show More
Lara Codecà received the master’s degree in computer sciences from the University of Bologna, Italy, in 2011, and the Ph.D. degree from the University of Luxembourg, in 2016. In 2011, she was a Visiting Fellow with Prof. Dr. Mario Gerla’s Vehicular Lab, University of California at Los Angeles, Los Angeles, CA, USA. She is currently a Post-Doctoral Fellow at the CATS Group, EURECOM, France. Her research interests include (...View more
Author image of Zhaojian Li
Department of Mechanical Engineering, Michigan State University, East Lansing, USA
Zhaojian Li received the bachelor’s degree from the Department of Civil Aviation, Nanjing University of Aeronautics and Astronautics, China, and the M.S. and Ph.D. degrees in aerospace engineering (flight dynamics and control) from the University of Michigan, Ann Arbor, MI, USA, in 2013 and 2015, respectively. From 2016 to 2017, he has worked as an Algorithm Engineer at General Motors. He is currently an Assistant Profess...Show More
Zhaojian Li received the bachelor’s degree from the Department of Civil Aviation, Nanjing University of Aeronautics and Astronautics, China, and the M.S. and Ph.D. degrees in aerospace engineering (flight dynamics and control) from the University of Michigan, Ann Arbor, MI, USA, in 2013 and 2015, respectively. From 2016 to 2017, he has worked as an Algorithm Engineer at General Motors. He is currently an Assistant Profess...View more

I. Introduction

As a consequence of population growth and urbanization, the transportation demand is steadily rising in the metropolises worldwide. The extensive routine traffic volumes bring pressures to existing urban traffic infrastructure, resulting in everyday traffic congestions. Adaptive traffic signal control (ATSC) aims for reducing potential congestions in saturated road networks, by adjusting the signal timing according to real-time traffic dynamics. Early-stage ATSC methods solve optimization problems to find efficient coordination and control policies. Successful products, such as SCOOT [1] and SCATS [2], have been installed in hundreds of cities across the world. OPAC [3] and PRODYN [4] are similar methods, but their relatively complex computation makes the implementation less popular. Since the 90s, various interdisciplinary techniques have been applied to ATSC, such as fuzzy logic [5], genetic algorithm [6], and immune network algorithm [7].

Author image of Tianshu Chu
Department of Civil and Environmental Engineering, Stanford University, USA
Tianshu Chu received the B.S. degree in physics from Weseda University, Tokyo, Japan, in 2010, and the M.S. and Ph.D. degrees from the Department of Civil and Environmental Engineering, Stanford University, in 2012 and 2016, respectively. He is currently a Data Scientist with Uhana Inc., and also an Adjunct Professor with the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include reinforcement learning, deep learning, multi-agent learning, and their applications to traffic signal control, wireless network control, autonomous driving, and other engineering control systems.
Tianshu Chu received the B.S. degree in physics from Weseda University, Tokyo, Japan, in 2010, and the M.S. and Ph.D. degrees from the Department of Civil and Environmental Engineering, Stanford University, in 2012 and 2016, respectively. He is currently a Data Scientist with Uhana Inc., and also an Adjunct Professor with the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include reinforcement learning, deep learning, multi-agent learning, and their applications to traffic signal control, wireless network control, autonomous driving, and other engineering control systems.View more
Author image of Jie Wang
Department of Civil and Environmental Engineering, Stanford University, USA
Jie Wang received the B.S. degree from Shanghai Jiao Tong University, the M.S. degrees from Stanford University and the University of Miami, and the Ph.D. degree in civil and environmental engineering from Stanford University, in 2003, where he is currently an Adjunct Professor with the Department of Civil and Environmental Engineering, and also the Executive Director of the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include information and knowledge management for sustainable development and innovation, enterprise IT infrastructure management, smart manufacturing, smart infrastructures and smart city, and environmental informatics.
Jie Wang received the B.S. degree from Shanghai Jiao Tong University, the M.S. degrees from Stanford University and the University of Miami, and the Ph.D. degree in civil and environmental engineering from Stanford University, in 2003, where he is currently an Adjunct Professor with the Department of Civil and Environmental Engineering, and also the Executive Director of the Stanford Center for Sustainable Development and Global Competitiveness. His research interests include information and knowledge management for sustainable development and innovation, enterprise IT infrastructure management, smart manufacturing, smart infrastructures and smart city, and environmental informatics.View more
Author image of Lara Codecà
Communication Systems Department, EURECOM, Sophia-Antipolis, France
Lara Codecà received the master’s degree in computer sciences from the University of Bologna, Italy, in 2011, and the Ph.D. degree from the University of Luxembourg, in 2016. In 2011, she was a Visiting Fellow with Prof. Dr. Mario Gerla’s Vehicular Lab, University of California at Los Angeles, Los Angeles, CA, USA. She is currently a Post-Doctoral Fellow at the CATS Group, EURECOM, France. Her research interests include (cooperative) intelligent transportation systems, vehicular traffic modelling, and big-data analysis. She is active in the SUMO community and collaborates with the SUMO developers at DLR (German Aerospace Centre).
Lara Codecà received the master’s degree in computer sciences from the University of Bologna, Italy, in 2011, and the Ph.D. degree from the University of Luxembourg, in 2016. In 2011, she was a Visiting Fellow with Prof. Dr. Mario Gerla’s Vehicular Lab, University of California at Los Angeles, Los Angeles, CA, USA. She is currently a Post-Doctoral Fellow at the CATS Group, EURECOM, France. Her research interests include (cooperative) intelligent transportation systems, vehicular traffic modelling, and big-data analysis. She is active in the SUMO community and collaborates with the SUMO developers at DLR (German Aerospace Centre).View more
Author image of Zhaojian Li
Department of Mechanical Engineering, Michigan State University, East Lansing, USA
Zhaojian Li received the bachelor’s degree from the Department of Civil Aviation, Nanjing University of Aeronautics and Astronautics, China, and the M.S. and Ph.D. degrees in aerospace engineering (flight dynamics and control) from the University of Michigan, Ann Arbor, MI, USA, in 2013 and 2015, respectively. From 2016 to 2017, he has worked as an Algorithm Engineer at General Motors. He is currently an Assistant Professor with the Department of Mechanical Engineering, Michigan State University. His research interests include learning-based control, nonlinear and complex systems, and robotics and automated vehicles. He was a recipient of the National Scholarship from China.
Zhaojian Li received the bachelor’s degree from the Department of Civil Aviation, Nanjing University of Aeronautics and Astronautics, China, and the M.S. and Ph.D. degrees in aerospace engineering (flight dynamics and control) from the University of Michigan, Ann Arbor, MI, USA, in 2013 and 2015, respectively. From 2016 to 2017, he has worked as an Algorithm Engineer at General Motors. He is currently an Assistant Professor with the Department of Mechanical Engineering, Michigan State University. His research interests include learning-based control, nonlinear and complex systems, and robotics and automated vehicles. He was a recipient of the National Scholarship from China.View more
Contact IEEE to Subscribe

References

References is not available for this document.