Loading [a11y]/accessibility-menu.js
Cropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images | IEEE Conference Publication | IEEE Xplore

Cropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images


Abstract:

It is very difficult to directly detect objects on the entire large scale remote sensing image, due to the limited GPU memory. Moreover, there are no objects of interest ...Show More

Abstract:

It is very difficult to directly detect objects on the entire large scale remote sensing image, due to the limited GPU memory. Moreover, there are no objects of interest in most areas of such a huge image, thus a lot of computational costs is wasted in dealing with these vain areas. Therefore, this paper proposes a Cropping Region Proposal Network (CRPN), which includes a weak semantic RPN for quickly locating interesting regions, and a dual-scale strategy for generating effective cropping regions. Cropping regions consist of small and large cropping scales for detecting various-scale objects including very small and very large objects, which is hard for existing methods. CRPN helps to detect effective regions of remote sensing image. Meanwhile, it is also modularized and can be easily connected with mainstream detectors to form an end-to-end detecting framework. Experiments on public DOTA dataset show that our CRPN is effective for filtering invalid regions to greatly reduce the computation burden, and helps to achieve more accurate object detection on large scale remote sensing images.
Date of Conference: 08-12 July 2019
Date Added to IEEE Xplore: 05 August 2019
ISBN Information:

ISSN Information:

Conference Location: Shanghai, China
References is not available for this document.

1. Introduction

The detection of geospatial objects is a fundamental and challenging problem for analyzing and understanding remote sensing images. Recently, deep learning has improved the performance significantly in various image processing techniques [1–6], and object detectors with convolutional neural network (CNN) have achieved the state-of-the-art accuracy. However, these successful detecting methods are difficult to exhibit their excellent performance on remote sensing images, since there are obvious differences between remote sensing images and natural scenes.

Select All
1.
K. Alex, S. Ilya and H. Geoffrey, "Imagenet classification with deep convolutional neural networks", NIPs, pp. 1097-1105, 2012.
2.
S. Ren, K. He, G. Ross and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks", NIPs, pp. 91-99, 2015.
3.
D. Zhang, D. Meng and J Han, "Co-saliency detection via a self-paced multiple-instance learning framework", TPAMI, vol. 39, no. 5, pp. 865-878, 2017.
4.
Y. Tian, C. Chen and S. Mubarak, "Cross-view image matching for geo-localization in urban environments", CVPR, pp. 1998-2006, 2017.
5.
C. Gong, P. Zhou and J. Han, "Duplex metric learning for image set classification", TIP, vol. 27, no. 1, pp. 281-292, 2018.
6.
D. Zhang, J. Han, L. Jiang, S. Ye and X. Chang, "Revealing event saliency in unconstrained video collection", TIP, vol. 26, no. 4, pp. 1746-1758, 2017.
7.
G. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, et al., "Dota: A large-scale dataset for object detection in aerial images", CVPR, 2018.
8.
G. Cheng, P. Zhou and J. Han, "Learning rotation-invariant convolutional neural networks for object detection in vhr optical remote sensing images", IEEE Transactions on Geoscience and Remote Sensing, vol. 54, no. 12, pp. 7405-7415, 2016.
9.
H. Zhu, X. Chen, W. Dai, K. Fu, Q. Ye and J. Jiao, "Orientation robust object detection in aerial images using deep convolutional neural network", ICIP, pp. 3735-3739, 2015.
10.
J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You only look once: Unified real-time object detection", CVPR, pp. 779-788, 2016.
11.
J. Redmon and A. Farhadi, "Yolo9000: better faster stronger", CVPR, pp. 7263-7271, 2017.
12.
J. Redmon and A. Farhadi, "Yolov3: An incremental improvement", arXiv preprint arXiv:1804.02767, 2018.
13.
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Fu, et al., "Ssd: Single shot multibox detector", ECCV, pp. 21-37, 2016.
14.
T. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan and S. Belongie, "Feature pyramid networks for object detection", CVPR, pp. 2117-2125, 2017.
15.
T. Lin, P. Goyal, R. Girshick, K. He and P. Dollar, "Focal loss for dense object detection", CVPR, pp. 2980-2988, 2017.
16.
S. Zhang, X. Zhu, Z. Lei, H. Shi, X. Wang and S. Li, "S3fd: Single shot scale-invariant face detector", CVPR, pp. 192-201, 2017.
17.
B. Singh, M. Najibi and L. Davis, "Sniper: Efficient multi-scale training", NIPs, pp. 9333-9343, 2018.

Contact IEEE to Subscribe

References

References is not available for this document.