I. Introduction
The Intelligent Transportation System (ITS), as an essential part of the smart city, is greatly facilitated by the development of emerging technologies. The Internet of Vehicles (IoVs) enables ITS to realize dynamic and intelligent management of traffic [1] [2]. Pursuit-evasion game (PEG), as a realistic problem for studying the self-learning and autonomous control of multiple agents, has been extensively studied in many fields, such as spacecraft control [3] and robot control [4]. Multi-vehicle pursuit (MVP), as an embodiment of PEG in ITS, has more conditional constraints, such as complex road structures, additional traffic participants, and traffic rules constraints. A patrol guide released by the New York City Police Department representatively describes an MVP game, where multiple policy vehicles cooperate to capture one or multiple suspected vehicles [5].