I. Introduction
Recent years have witnessed significant efforts on improving vehicle safety and efficiency, with emerging driving assistance applications, such as avoiding collisions, augment reality navigation, lane following, etc. [1], [2], and [3] in the intelligent transportation systems. Generally, these applications rely on machine learning techniques to achieve smart and better performance. Specifically, these applications perform inference on real-time input data using pre-trained machine learning models. As reported by Gartner, autonomous vehicles will be among the top five fields of artificial intelligence software spending with a growth rate of 20.1% in 2022 [4]. However, these applications generally suffer from poor quality of service (QoS). This is because the massive resources required for computation, communication and storage in machine learning inference are limited in vehicles. Internet of vehicles (IoV) has attracted lots of attention to provide high quality of computing services [5], [6], [7]. Thus, vehicular edge computing (VEC) has emerged as an appealing paradigm to support delay-sensitive and computationally intensive services, by exploiting computation, storage and communication resources at the edge of vehicular network [8], [9]. Specifically, in VEC, edge servers are deployed on roadside units (RSUs) to enable on-board driving tasks meet real-time requirements [10], [11], [12].