Loading [MathJax]/extensions/MathMenu.js
Learning RoI Transformer for Oriented Object Detection in Aerial Images | IEEE Conference Publication | IEEE Xplore

Learning RoI Transformer for Oriented Object Detection in Aerial Images


Abstract:

Object detection in aerial images is an active yet challenging task in computer vision because of the bird’s-eye view perspective, the highly complex backgrounds, and the...Show More

Abstract:

Object detection in aerial images is an active yet challenging task in computer vision because of the bird’s-eye view perspective, the highly complex backgrounds, and the variant appearances of objects. Especially when detecting densely packed objects in aerial images, methods relying on horizontal proposals for common object detection often introduce mismatches between the Region of Interests (RoIs) and objects. This leads to the common misalignment between the final object classification confidence and localization accuracy. In this paper, we propose a RoI Transformer to address these problems. The core idea of RoI Transformer is to apply spatial transformations on RoIs and learn the transformation parameters under the supervision of oriented bounding box (OBB) annotations. RoI Transformer is with lightweight and can be easily embedded into detectors for oriented object detection. Simply apply the RoI Transformer to light head RCNN has achieved state-of-the-art performances on two common and challenging aerial datasets, i.e., DOTA and HRSC2016, with a neglectable reduction to detection speed. Our RoI Transformer exceeds the deformable Position Sensitive RoI pooling when oriented bounding-box annotations are available. Extensive experiments have also validated the flexibility and effectiveness of our RoI Transformer.
Date of Conference: 15-20 June 2019
Date Added to IEEE Xplore: 09 January 2020
ISBN Information:

ISSN Information:

Conference Location: Long Beach, CA, USA
Citations are not available for this document.

1. Introduction

Object detection in aerial images aims at locating objects of interest (e.g., vehicles, airplanes) on the ground and identifying their categories. With more and more aerial images being available, object detection in aerial images has been a specific but active topic in computer vision [3], [29], [36], [6]. However, unlike natural images that are often taken from horizontal perspectives, aerial images are typically taken from bird’s-eye view, which implies that objects in aerial images are always arbitrary oriented. Moreover, the highly complex backgrounds and variant appearances of objects further increase the difficulty of object detection in aerial images. These problems have been often approached by an oriented and densely packed object detection task [37], [31], [12], which is new while well-grounded and have attracted much attention in the past decade [27], [30], [26], [18], [1].

Cites in Papers - |

Cites in Papers - IEEE (549)

Select All
1.
Yan Li, Lingyi Liu, Yunpeng Bai, Ying Li, Qiang Shen, "Appearance- and Orientation-aware Fine-grained Rotated Ship Detection in High-Resolution Satellite Imagery", ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.1-5, 2025.
2.
Lin Jiao, Haiyun Liu, Zheng Liang, Peng Chen, Rujing Wang, Kang Liu, "An Anchor-Free Refining Feature Pyramid Network for Dense and Multioriented Wheat Spikes Detection Under UAV", IEEE Transactions on Instrumentation and Measurement, vol.74, pp.1-14, 2025.
3.
Shuoyi Chen, Mang Ye, Yan Huang, Bo Du, "Towards Effective Rotation Generalization in UAV Object Re-Identification", IEEE Transactions on Information Forensics and Security, vol.20, pp.2593-2606, 2025.
4.
Jie Yang, Li Zhou, Yongfeng Ju, "AFDR-Det: Adaptive Feature Dual-Refinement Oriented Detector for Remote Sensing Object Detection", IEEE Access, vol.13, pp.32901-32917, 2025.
5.
Dong Ren, Yang Liu, Hang Sun, Lefei Zhang, Jun Wan, "Hierarchical Heterogeneous Geometric Foreground Perception Network for Remote Sensing Object Detection", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-17, 2025.
6.
Yun Xiao, Jinfa Wang, Zhicheng Zhao, Bo Jiang, Chenglong Li, Jin Tang, "UAV Video Vehicle Detection: Benchmark and Baseline", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-14, 2025.
7.
Tian Lu, Zi Wang, Junfang Wang, Xiguan Li, Zhang Li, "A Postdetection Framework With Optimal Transport for Multiclass Object Change Detection", IEEE Geoscience and Remote Sensing Letters, vol.22, pp.1-5, 2025.
8.
Tong Zhang, Yin Zhuang, Guanqun Wang, He Chen, Lianlin Li, Jun Li, "A Unified Remote Sensing Object Detector Based on Fourier Contour Parametric Learning", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-25, 2025.
9.
Hai Lin, Ji Wang, Jingguo Li, "PFRNet: A Small Object Detection Method Based on Parallel Feature Extraction and Attention Mechanism", IEEE Access, vol.13, pp.26727-26738, 2025.
10.
Qifeng Lin, Haibin Huang, Daoye Zhu, Nuo Chen, Gang Fu, Yuanlong Yu, "Multiple Region Proposal Experts Network for Wide-Scale Remote Sensing Object Detection", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-16, 2025.
11.
Qifeng Lin, Nuo Chen, Haibin Huang, Daoye Zhu, Gang Fu, Chuanxi Chen, Yuanlong Yu, "Attention-Based Mean-Max Balance Assignment for Oriented Object Detection in Optical Remote Sensing Images", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-15, 2025.
12.
Huiying Wang, Chunping Wang, Qiang Fu, Binqiang Si, Dongdong Zhang, Renke Kou, Ying Yu, Changfeng Feng, "MINIAOD: Lightweight Aerial Image Object Detection", IEEE Sensors Journal, vol.25, no.5, pp.9167-9184, 2025.
13.
Ziqian Guan, Xieyi Fu, Pengjun Huang, Hengyuan Zhang, Hubin Du, Yongtao Liu, Yinglin Wang, Qang Ma, "Gaussian Combined Distance: A Generic Metric for Object Detection", IEEE Geoscience and Remote Sensing Letters, vol.22, pp.1-5, 2025.
14.
Xiping Shang, Nannan Li, Dongjin Li, Jianwei Lv, Wei Zhao, Rufei Zhang, Jingyu Xu, "CCLDet: A Cross-Modality and Cross-Domain Low-Light Detector", IEEE Transactions on Intelligent Transportation Systems, vol.26, no.3, pp.3284-3294, 2025.
15.
Wei Bao, Meiyu Huang, Jingjing Hu, Xueshuang Xiang, "Dual-Dynamic Cross-Modal Interaction Network for Multimodal Remote Sensing Object Detection", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-13, 2025.
16.
Jin Liu, Zhongyuan Lu, Yaorong Cen, Hui Hu, Zhenfeng Shao, Yong Hong, Ming Jiang, Miaozhong Xu, "Enhancing Object Detection With Fourier Series", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.47, no.4, pp.2581-2596, 2025.
17.
Xiao-Nan Jiang, Xiang-Qian Niu, Fan-Lu Wu, Yao Fu, He Bao, Yan-Chao Fan, Yu Zhang, Jun-Yan Pei, "A Fine-Grained Aircraft Target Recognition Algorithm for Remote Sensing Images Based on YOLOV8", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.18, pp.4060-4073, 2025.
18.
Mo Zhou, Yue Zhou, Dawei Yang, Kai Song, "Pretrained Detail Enhancement Framework for Remote Sensing Object Detection", IEEE Access, vol.13, pp.6362-6377, 2025.
19.
Zhiming Deng, Tianyu Zhang, Cheng Wei, Xibin Cao, "Fast Object Detection and Localization in Ultrawide Swath Rotating Scan Remote Sensing Images", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.18, pp.4767-4779, 2025.
20.
Yangfan Li, Liang Chen, Wei Li, "Fine-Grained Ship Recognition With Spatial-Aligned Feature Pyramid Network and Adaptive Prototypical Contrastive Learning", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-13, 2025.
21.
Min Dang, Gang Liu, Hao Li, Di Wang, Rong Pan, Quan Wang, "PRA-Det: Anchor-Free Oriented Object Detection With Polar Radius Representation", IEEE Transactions on Multimedia, vol.27, pp.145-157, 2025.
22.
Gui Gao, Yajun Wang, Yuhao Chen, Gang Yang, Libo Yao, Xi Zhang, Hengchao Li, Gaosheng Li, "An Oriented Ship Detection Method of Remote Sensing Image With Contextual Global Attention Mechanism and Lightweight Task-Specific Context Decoupling", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-18, 2025.
23.
Minding Fang, Yu Gu, Dongliang Peng, "FEVT-SAR: Multicategory Oriented SAR Ship Detection Based on Feature Enhancement Vision Transformer", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.18, pp.2704-2717, 2025.
24.
Youming Wu, Yuxi Suo, Qingbiao Meng, Wei Dai, Tian Miao, Wenchao Zhao, Zhiyuan Yan, Wenhui Diao, Guocun Xie, Qingyang Ke, Yiming Zhao, Kun Fu, Xian Sun, "FAIR-CSAR: A Benchmark Dataset for Fine-Grained Object Detection and Recognition Based on Single-Look Complex SAR Images", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-22, 2025.
25.
Zhenyu Fang, Jinchang Ren, Jiangbin Zheng, Rongjun Chen, Huimin Zhao, "Dual Teacher: Improving the Reliability of Pseudo Labels for Semi-Supervised Oriented Object Detection", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-15, 2025.
26.
Shu Tian, Li Wang, Lin Cao, Lihong Kang, Xian Sun, Jing Tian, Xiangwei Xing, Bo Shen, Chunzhuo Fan, Kangning Du, Chong Fu, Ye Zhang, "A Dynamic Cascade Cross-Modal Coassisted Network for AAV Image Object Detection", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.18, pp.2749-2765, 2025.
27.
Yan Feng, Yupeng Zhang, Xiangqing Zhang, Yuning Wang, Shaohui Mei, "Large Convolution Kernel Network With Edge Self-Attention for Oriented SAR Ship Detection", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.18, pp.2867-2879, 2025.
28.
Haodong He, Jian Ding, Bowen Xu, Gui-Song Xia, "On the Robustness of Object Detection Models on Aerial Images", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-12, 2025.
29.
Hailiang Huang, Jingchao Guo, Huangxing Lin, Yue Huang, Xinghao Ding, "Domain Adaptive Oriented Object Detection From Optical to SAR Images", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-14, 2025.
30.
Peijin Wang, Huiyang Hu, Boyuan Tong, Ziqi Zhang, Fanglong Yao, Yingchao Feng, Zining Zhu, Hao Chang, Wenhui Diao, Qixiang Ye, Xian Sun, "RingMoGPT: A Unified Remote Sensing Foundation Model for Vision, Language, and Grounded Tasks", IEEE Transactions on Geoscience and Remote Sensing, vol.63, pp.1-20, 2025.

Cites in Papers - Other Publishers (368)

1.
Jiangshan Wang, Yifan Pu, Yizeng Han, Jiayi Guo, Yiru Wang, Xiu Li, Gao Huang, "GRA: Detecting Oriented Objects Through Group-Wise Rotating and\\xa0Attention", Computer Vision – ECCV 2024, vol.15075, pp.298, 2025.
2.
Hongwei Zhang, Lei Jin, Xuechao Zou, Jian Zhao, Junliang Xing, "GOP: A Group Object Perception Framework for Optical Remote Sensing", Pattern Recognition and Computer Vision, vol.15042, pp.214, 2025.
3.
Zhongjie Hu, Qi Liu, Song-Lu Chen, Yan Liu, Feng Chen, Xu-Cheng Yin, "Integrated Recognition of Arbitrary-Oriented Multi-line Billet Number", Pattern Recognition and Computer Vision, vol.15037, pp.114, 2025.
4.
Kun Wang, Zi Wang, Zhang Li, Xichao Teng, Yang Li, "Multi-Scale Cross Distillation for\\xa0Object Detection in\\xa0Aerial Images", Computer Vision – ECCV 2024, vol.15107, pp.452, 2025.
5.
Yanhao Chu, Qiang Tong, Xuhong Liu, Xiulei Liu, "ODAdapter: An Effective Method of\\xa0Semi-supervised Object Detection for\\xa0Aerial Images", Pattern Recognition and Computer Vision, vol.15033, pp.158, 2025.
6.
Wang Cao, Zhifu Huang, Yu Liu, "Shape-Aware Soft Label Assignment and Context Enhancement for Oriented Object Detection", Pattern Recognition and Computer Vision, vol.15043, pp.327, 2025.
7.
Zeyang Zhao, Qilong Xue, Yuhang He, Yifan Bai, Xing Wei, Yihong Gong, "Projecting Points to\\xa0Axes: Oriented Object Detection via\\xa0Point-Axis Representation", Computer Vision – ECCV 2024, vol.15086, pp.161, 2025.
8.
Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu, "Toward Open Vocabulary Aerial Object Detection with\\xa0CLIP-Activated Student-Teacher Learning", Computer Vision – ECCV 2024, vol.15144, pp.431, 2025.
9.
Hongyuan Wang, Lizhi Wang, Jiang Xu, Chang Chen, Xue Hu, Fenglong Song, Youliang Yan, "Learning Exhaustive Correlation for\\xa0Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence", Computer Vision – ECCV 2024, vol.15083, pp.375, 2025.
10.
Minyoung Back, Jaewoo Ok, Heesub Shin, "Benchmarking SAR Target Detection Networks and Analysis of\ Degradation Depending on the Phase Error", The Journal of Korean Institute of Electromagnetic Engineering and\ Science, vol.35, no.10, pp.770, 2024.
11.
Hongmei Wang, Chenkai Li, Qiaorong Wu, Jingyu Wang, "An Improved DETR Based on Angle Denoising and Oriented Boxes Refinement for Remote Sensing Object Detection", Remote Sensing, vol.16, no.23, pp.4420, 2024.
12.
Aonan Cheng, Jincheng Xiao, Yingcheng Li, Yiming Sun, Yafeng Ren, Jianli Liu, "Enhancing Remote Sensing Object Detection with K-CBST YOLO: Integrating CBAM and Swin-Transformer", Remote Sensing, vol.16, no.16, pp.2885, 2024.
13.
Jing Liu, Donglin Jing, Yanyan Cao, Ying Wang, Chaoping Guo, Peijun Shi, Haijing Zhang, "Lightweight Progressive Fusion Calibration Network for Rotated Object Detection in Remote Sensing Images", Electronics, vol.13, no.16, pp.3172, 2024.
14.
Boyu Wang, Donglin Jing, Xiaokai Xia, Yu Liu, Luo Xu, Jiangmai Cheng, "DDE-Net: Dynamic Density-Driven Estimation for Arbitrary-Oriented Object Detection", Electronics, vol.13, no.15, pp.3029, 2024.
15.
Yong Tang, Hongan Pan, Jun Guo, Fei Shen, Zhengzhou Zhu, Honghui Jia, "Fourier-FPN: Fourier Improves Multi-scale Feature Learning for Oriented Tiny Object Detection", Advanced Intelligent Computing Technology and Applications, vol.14871, pp.450, 2024.
16.
jiaxu leng, Yongming Ye, Mengjingcheng MO, Chenqiang Gao, Ji Gan, Bin Xiao, Xinbo Gao, "Recent Advances for Aerial Object Detection: A Survey", ACM Computing Surveys, 2024.
17.
Weishan Zhao, Lijia Huang, Haitian Liu, Chaobao Yan, "Scattering-Point-Guided Oriented RepPoints for Ship Detection", Remote Sensing, vol.16, no.6, pp.933, 2024.
18.
Chi-Yi Tsai, Wei-Chuan Lin, "Precise Orientation Estimation for Rotated Object Detection Based on a Unit Vector Coding Approach", Electronics, vol.13, no.22, pp.4402, 2024.
19.
Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Erting Pan, Minhao Liu, Qifeng Yu, , 2024.
20.
Ridhima Rani, Neeraj Kumar, Meenu Khurana, "Redundancy elimination in IoT oriented big data: a survey, schemes, open challenges and future applications", Cluster Computing, 2024.
21.
Binhuan Yuan, Xiyang Zhi, Jianming Hu, Wei Zhang, "Boosting Point Set-Based Network with Optimal Transport Optimization for Oriented Object Detection", Remote Sensing, vol.16, no.22, pp.4133, 2024.
22.
Hongjian Guo, Xianlin Zhou, Peng Yang, "Feature Enhancement Based Oriented Object Detection in Remote Sensing Images", Neural Processing Letters, vol.56, no.6, 2024.
23.
Touati Adli, Dimitrije Bujaković, Boban Bondžulić, Mohammed Zouaoui Laidouni, Milenko Andrić, "A Modified YOLOv5 Architecture for Aircraft Detection in Remote Sensing Images", Journal of the Indian Society of Remote Sensing, 2024.
24.
Jinpeng Wang, Nan Su, Chunhui Zhao, Yiming Yan, Shou Feng, "Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network", Remote Sensing, vol.16, no.20, pp.3904, 2024.
25.
Peitong He, Sijian Zhao, Pan Pan, Guomin Zhou, Jianhua Zhang, "PDC-YOLO: A Network for Pig Detection under Complex Conditions for Counting Purposes", Agriculture, vol.14, no.10, pp.1807, 2024.
26.
Lingfei Ren, Huan Lei, Zhongxu Li, Wenyuan Yang, "AF-DETR: efficient UAV small object detector via Assemble-and-Fusion mechanism", Pattern Analysis and Applications, vol.27, no.4, 2024.
27.
Xiaorun Hong, Dongjie Fu, Jiasheng Tang, Vincent Lyne, Ming Luo, Fenzhen Su, "Ship detection in reefs and deep-sea with medium-high resolution images", Geo-spatial Information Science, pp.1, 2024.
28.
Zhili Lin, Biao Leng, "SSN: Scale Selection Network for Multi-Scale Object Detection in Remote Sensing Images", Remote Sensing, vol.16, no.19, pp.3697, 2024.
29.
Yuxuan Li, Xiang Li, Yimain Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang, "LSKNet: A Foundation Lightweight Backbone for Remote Sensing", International Journal of Computer Vision, 2024.
30.
Chengyang Qian, Long Zhao, Peilin Ni, Yu Zhang, Zini Cao, Cai Jia, , 2024.
Contact IEEE to Subscribe

References

References is not available for this document.