I. Introduction
With the development of convolution networks, object detection has made great progress in recent years. Object detection has been widely used in many real-world applications, such as hazard detection, environmental monitoring, change detection, and urban planning [2]. The traditional object detection methods (see [9], [12]) suffer from a common issue: a large-scale, diverse dataset is required to train a deep neural network model. Obtaining the labeled data is expensive and time-consuming, which imposes serious limitations on the applicability of object detection in real-world applications. In addition, training the model with a few novel class samples leads to an overfitting problem. Therefore, it becomes important to investigate a special method to learn robust detection from a few novel class samples.