Journals & Magazines >IEEE Transactions on Multimedia >Volume: 26

USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

With the ease of accessing large unlabeled datasets, studies on semi-supervised learning for object detection (SSOD) have become increasingly popular. Among these SSOD st...Show More

Metadata

Abstract:

With the ease of accessing large unlabeled datasets, studies on semi-supervised learning for object detection (SSOD) have become increasingly popular. Among these SSOD studies, the pseudo-labeling method significantly depends on the accuracy of the pseudo-labels; thus, inaccurate annotations must be filtered to prevent performance degradation. This study classifies annotation errors that occur in pseudo-labeling methods as false negative (FN) and false positive (FP), and solutions to address each type of error are proposed using uncertainty information obtained through Gaussian modeling. Network performance is improved by preventing the background learning of the FN objects based on the uncertainty of the network output. In addition, based on the uncertainty of the annotations, low-reliability annotations are filtered out, and the learning reflectivity of FP objects is determined. Considering the network performance improvement and training complexity, the proposed method employs one-phase learning, including a single pseudo-label update, to achieve maximum performance with the minimum learning process. Moreover, an algorithm is proposed for an optimal update point search to increase the expected performance improvement. Experiments on the Pascal VOC, COCO, and Cityscapes datasets show that the SSD network improves accuracy by 3.3%, 4.7%, and 4.1%, respectively, with negligible computational complexity compared to the baseline.

Published in: IEEE Transactions on Multimedia ( Volume: 26)

Page(s): 6336 - 6347

Date of Publication: 01 January 2024

ISSN Information:

DOI: 10.1109/TMM.2023.3348662

Funding Agency:

References is not available for this document.

Contents

I. Introduction

Deep learning models are significantly affected by the quality [1] and quantity [2] of the dataset used for training. Supervised learning with labeled datasets is common in deep learning model training. In particular, for object detection tasks that require high accuracy, a high-quality labeled dataset is essential because the accuracy decreases when low-quality labels are used, including errors such as localization, classification, and false errors [3]. However, obtaining numerous high-quality labeled datasets is challenging owing to the high cost of annotation [4], [5]. Moreover, a small training dataset also reduces accuracy because it is not representative of the actual distribution of data [2]. Although abundant unlabeled data are readily available in the real world, they cannot be directly used for supervised learning. Therefore, by utilizing a large number of new datasets suitable for the user environment, obtained from numerous mobile devices, an optimal trained model can be secured for each user. Recently, to use unlabeled datasets generated from each device on the user side, a method of transmitting data to a data center and processing the data on a cloud server has been widely used [6], [7]. However, this approach faces many obstacles, including data privacy difficulties [8], transmission issues [9], cloud computing burden, data center maintenance [10], and annotation costs [4]. If optimized training is possible on personal devices (e.g., mobile/edge devices) using large unlabeled datasets, benefits such as lower annotation cost, lower cloud processing cost, and improved model accuracy for personal applications can be provided [11]. Therefore, semi-supervised learning for object detection (SSOD) [12], [13], [14], which trains networks using large and readily available unlabeled datasets, is becoming increasingly popular.

References is not available for this document.

USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?