Loading [MathJax]/extensions/MathZoom.js
Mining Oriented Information for Semi-Supervised Object Detection in Remote Sensing Images | IEEE Conference Publication | IEEE Xplore

Mining Oriented Information for Semi-Supervised Object Detection in Remote Sensing Images


Abstract:

Traditional object detection requires extensive annotation, consuming considerable time and manpower. In recent years, semi-supervised object detection (SSOD) methods, wh...Show More

Abstract:

Traditional object detection requires extensive annotation, consuming considerable time and manpower. In recent years, semi-supervised object detection (SSOD) methods, which utilize a blend of unlabeled and labeled data to train object detectors, have been extensively researched. SSOD has made significant progress and can achieve similar levels of accuracy as the fully supervised methods. However, existing SSOD approaches primarily focus on horizontal objects in natural scenes, with scant research on other scenarios such as remote sensing images. This paper proposes a novel semi-supervised oriented object detection algorithm based on a two-stage object detector. For oriented objects in remote sensing, we particularly emphasize the utilization of oriented information of remote sensing objects to generate precise pseudo-label and improve the learning capability of the student network. Iterative labeled data filtering is performed by incorporating metrics with designed measurements. Valuable annotated samples can enhance the quality of pseudo-label generation. A strong augmentation method has been designed to utilize rotational information, enabling the student network to learn more diverse features. Additionally, we investigate the long-tail distribution problem in remote sensing images and mitigate the bias brought by category imbalance through phased training and post-processing in detection. Our experiments demonstrate that the designed semi-supervised oriented object detection method surpasses existing methods in the DOTAv1.5 benchmark, culminating in state-of-the-art performance.
Date of Conference: 30 June 2024 - 05 July 2024
Date Added to IEEE Xplore: 09 September 2024
ISBN Information:

ISSN Information:

Conference Location: Yokohama, Japan

Funding Agency:

References is not available for this document.

I. Introduction

In the realm of fully supervised object detection, the availability of adequately labeled data is essential. The process of labeled data is both time-consuming and costly. Recently, the SSOD approach has garnered significant interest in the field. This emerging technique enhances the training of object detectors by integrating labeled data with more readily accessible unlabeled data. Most SSOD methodologies focus on detecting objects using horizontal bounding boxes in traditional settings. However, in more complex scenarios, such as the analysis of remote sensing images, objects often require annotation with oriented bounding boxes. Given the high costs associated with labeling oriented boxes, the exploration and development of semi-supervised methods tailored for such contexts are increasingly important. These efforts could substantially reduce costs while improving the accuracy and efficiency of oriented object detection.

Select All
1.
B. Chen, P. Li, X. Chen, B. Wang, L. Zhang and X.-S. Hua, "Dense learning based semi-supervised object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4815-4824, 2022.
2.
Y. Tang, W. Chen, Y. Luo and Y. Zhang, "Humble teachers teach better students for semi-supervised object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3132-3141, 2021.
3.
A. Tarvainen and H. Valpola, "Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results", Advances in neural information processing systems, vol. 30, 2017.
4.
Y. Wang, L. Yao, G. Meng, X. Zhang, J. Song and H. Zhang, "Semi-supervised object detection in remote sensing images based on active learning", IGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium, pp. 5571-5574, 2023.
5.
K. Sohn, D. Berthelot, N. Carlini, Z. Zhang, H. Zhang, C. A. Raffel, et al., "Fixmatch: Simplifying semi-supervised learning with consistency and confidence", Advances in neural information processing systems, vol. 33, pp. 596-608, 2020.
6.
V. Verma, K. Kawaguchi, A. Lamb, J. Kannala, A. Solin, Y. Bengio, et al., "Interpolation consistency training for semi-supervised learning", Neural Networks, vol. 145, pp. 90-106, 2022.
7.
J. Kim, J. Jang, S. Seo, J. Jeong, J. Na and N. Kwak, "Mum: Mix image tiles and unmix feature tiles for semi-supervised object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14512-14521, 2022.
8.
D. Berthelot, N. Carlini, I. Goodfellow, N. Papernot, A. Oliver and C. A. Raffel, "Mixmatch: A holistic approach to semi-supervised learning", Advances in neural information processing systems, vol. 32, 2019.
9.
Y. Grandvalet and Y. Bengio, "Semi-supervised learning by entropy minimization", Advances in neural information processing systems, vol. 17, 2004.
10.
D.-H. Lee et al., "Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks", Workshop on challenges in representation learning, vol. 3, no. 2, pp. 896, 2013.
11.
Q. Xie, Z. Dai, E. Hovy, T. Luong and Q. Le, "Unsupervised data augmentation for consistency training", Advances in neural information processing systems, vol. 33, pp. 6256-6268, 2020.
12.
M. Sajjadi, M. Javanmardi and T. Tasdizen, "Regularization with stochastic transformations and perturbations for deep semi-supervised learning", Advances in neural information processing systems, vol. 29, 2016.
13.
T. Chen, S. Kornblith, M. Norouzi and G. Hinton, "A simple framework for contrastive learning of visual representations", International conference on machine learning, pp. 1597-1607, 2020.
14.
T. Miyato, S.-i. Maeda, M. Koyama and S. Ishii, "Virtual adversarial training: a regularization method for supervised and semi-supervised learning", IEEE transactions on pattern analysis and machine intelligence, vol. 41, no. 8, pp. 1979-1993, 2018.
15.
J. Jeong, S. Lee, J. Kim and N. Kwak, "Consistency-based semi-supervised learning for object detection", Advances in neural information processing systems, vol. 32, 2019.
16.
K. Sohn, Z. Zhang, C.-L. Li, H. Zhang, C.-Y. Lee and T. Pfister, "A simple semi-supervised learning framework for object detection", 2020.
17.
Y.-C. Liu, C.-Y. Ma, Z. He, C.-W. Kuo, K. Chen, P. Zhang, et al., "Unbiased teacher for semi-supervised object detection", 2021.
18.
T.-Y. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, "Focal loss for dense object detection", Proceedings of the IEEE international conference on computer vision, pp. 2980-2988, 2017.
19.
H. Zhou, Z. Ge, S. Liu, W. Mao, Z. Li, H. Yu, et al., "Dense teacher: Dense pseudo-labels for semi-supervised object detection", European Conference on Computer Vision, pp. 35-50, 2022.
20.
W. Hua, D. Liang, J. Li, X. Liu, Z. Zou, X. Ye, et al., "Sood: Towards semi-supervised oriented object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15558-15567, 2023.
21.
R. Girshick, "Fast r-cnn", Proceedings of the IEEE international conference on computer vision, pp. 1440-1448, 2015.
22.
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, et al., "Ssd: Single shot multibox detector", European Conference on Computer Vision, pp. 21-37, 2016.
23.
J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You only look once: Unified real-time object detection", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
24.
S. Ren, K. He, R. Girshick and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks", Advances in neural information processing systems, vol. 28, 2015.
25.
J. Han, J. Ding, N. Xue and G.-S. Xia, "Redet: A rotation-equivariant detector for aerial object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2786-2795, 2021.
26.
M. Liao, Z. Zhu, B. Shi, G.-s. Xia and X. Bai, "Rotation-sensitive regression for oriented scene text detection", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5909-5918, 2018.
27.
X. Yang, X. Yang, J. Yang, Q. Ming, W. Wang, Q. Tian, et al., "Learning high-precision bounding box for rotated object detection via kullback-leibler divergence", Advances in neural information processing systems, vol. 34, pp. 18381-18394, 2021.
28.
X. Xie, G. Cheng, J. Wang, X. Yao and J. Han, "Oriented r-cnn for object detection", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3520-3529, 2021.
29.
C. Beckham, S. Honari, V. Verma, A. M. Lamb, F. Ghadiri, R. D. Hjelm, et al., "On adversarial mixup resynthesis", Advances in neural information processing systems, vol. 32, 2019.
30.
D. Berthelot, C. Raffel, A. Roy and I. Goodfellow, "Understanding and improving interpolation in autoencoders via an adversarial regularizer", 2018.
Contact IEEE to Subscribe

References

References is not available for this document.