1. Introduction
Autonomous driving has rapidly advanced with promising progress in both industry and academia. A crucial component of this development is offboard 3D object detection, which can utilize entire sequence data from sensors (video or sequential point cloud) with few constraints on model capacity and inference speed. Therefore, some approaches [33], [50] are dedicated to developing high-quality “auto labels”, aiming to reduce manual labor in point cloud annotation.