I. Introduction
Vision-based microaerial vehicle (MAV) detection and confidence measurement are crucial in various applications, such as flight safety, collision avoidance, autonomous navigation in air transportation, multi-MAV visual formation, and privacy protection [1]. The ability to detect small targets such as MAVs is a critical performance metric for assessing detection methods [2]. Existing advanced detectors require significant computational resources and memory footprint, challenging their implementation on memory-constrained edge-computing devices, while low-power consumption detectors typically suffer from insufficient accuracy. A compact MAV detection framework that ensures efficient and accurate detection with reduced computational overhead and memory usage can facilitate its application in edge-oriented microdeep learning systems.