Conferences >2018 IEEE Third International...

Performance Comparison of Deep Learning Techniques for Recognizing Birds in Aerial Images

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In computer vision, significant advances have been made in recent years on object recognition and detection with the rapid development of deep learning, especially deep c...Show More

Metadata

Abstract:

In computer vision, significant advances have been made in recent years on object recognition and detection with the rapid development of deep learning, especially deep convolutional neural networks (CNN). The majority of deep learning methods for object detection have been developed for large objects and their performances on small-object detection are not very good. This paper contributes to research in low-resolution small-object detection by evaluating the performances of leading deep learning methods for object detection using a common dataset, which is a new dataset for bird detection, called Little Birds in Aerial Imagery (LBAI), created from real-life aerial imagery data. LBAI contains birds with sizes ranging from 10px to 40px. In our experiments, some of the best deep learning architectures were implemented and applied to LBAI, which include object detection techniques such as YOLOv2, SSH, and Tiny Face, in addition to small instance segmentation techniques including U-Net and Mask R-CNN. Among the object detection methods, experimental results demonstrated that SSH performed the best for easy cases, whereas Tiny Face performed the best for hard cases, i.e. where a cluttered background makes detecting birds difficult. Among small instance segmentation methods, experimental results revealed U-Net achieved slightly better performance than Mask R-CNN.

Published in: 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC)

Date of Conference: 18-21 June 2018

Date Added to IEEE Xplore: 19 July 2018

ISBN Information:

DOI: 10.1109/DSC.2018.00052

Conference Location: Guangzhou, China

Contents

I. Introduction

Object detection is one of the crucial tasks in computer vision. In the past few years, the performance of obj ect detection [1]–[14] has dramatically improved due to the success of deep convolutional neural networks (CNN). Typically, object detection and recognition involve two steps: first, deep neural networks are used to localize the potential location of each target object; then, objects are classified into appropriate classes. If the first step can effectively localize the potential object, the second step will be easier. Even though the two-step approach achieved state-of-the-art performance, the running times are usually slow [11]. Therefore, one-stage detectors have been developed to improve the speed,

References is not available for this document.

MIT Libraries

MIT Libraries

Performance Comparison of Deep Learning Techniques for Recognizing Birds in Aerial Images

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Performance Comparison of Deep Learning Techniques for Recognizing Birds in Aerial Images

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References