I. Introduction
The proliferation of video sensors and the significant advancement in deep neural networks (DNNs) have fostered a plethora of video analytics applications, such as pedestrian detection [1], traffic surveillance [2], and object tracking [3]. Advanced DNNs usually require intensive computation, which can easily drain the computing, storage, and battery resources on end devices that capture the video. Conversely, conventional cloud offloading may introduce prolonged latency and privacy concerns due to the long-distance data transmission. As a result, edge video analytics emerges as an effective solution [4]–[6], that offloads the streaming data to nearby edge servers for real-time video analysis.