1. Introduction
Object detection in aerial images aims at locating objects of interest (e.g., vehicles, airplanes) on the ground and identifying their categories. With more and more aerial images being available, object detection in aerial images has been a specific but active topic in computer vision [3], [29], [36], [6]. However, unlike natural images that are often taken from horizontal perspectives, aerial images are typically taken from bird’s-eye view, which implies that objects in aerial images are always arbitrary oriented. Moreover, the highly complex backgrounds and variant appearances of objects further increase the difficulty of object detection in aerial images. These problems have been often approached by an oriented and densely packed object detection task [37], [31], [12], which is new while well-grounded and have attracted much attention in the past decade [27], [30], [26], [18], [1].