I. Introduction
Object detection with accurate locating and classifying capabilities plays an important role in remote sensing image understanding [1]. The performance of object detection algorithms has made great strides in recent years since the involvement of deep learning [2], [3]. Deep convolutional neural network (DCNN) model for object detection [4], [5], [6], [7], [8], [9] plays an important role in the interpretation of very high spatial resolution (VHR) remote sensing imageries, such as urban planning [10] and environmental monitoring [11].