I. Introduction
The autonomous driving landscape has undergone a remarkable transformation due to the significant advancement in autonomous driving perception technology, the continuous progress in deep learning and enhanced computational capabilities, and the availability of cost-effective onboard sensors, such as cameras, radars, and LiDARs. To ensure the seamless operation of autonomous commercial vehicles in complex traffic scenarios, it is necessary to develop a robust perception system with the following essential characteristics [1], [2], [3].
Accuracy: The perception system must accurately identify the categories and sizes of nearby objects and predict their behavior in complex traffic scenarios. For high-speed commercial vehicles on highways, it is crucial to improve the perception of distant small objects. The detection capability of small objects is severely tested.
Real-Time Performance: The perception task should achieve high accuracy while completing it as quick as possible. Real-time performance is critical because high-latency perception systems can cause delays in vehicle decision-making and control [4], potentially resulting in fatal accidents. Lightweight models appropriate for onboard perception modules are crucial, necessitating low computational power and cost, as well as strong processing capabilities for small objects.
Robustness: The perception system must maintain normal operational performance in adverse environmental conditions, such as occlusions and low-light situations, ensuring the safety and stability of autonomous driving [5].