Loading [MathJax]/extensions/MathMenu.js
Synthetic Training Data Generation and Domain Randomization for Object Detection in the Formula Student Driverless Framework | IEEE Conference Publication | IEEE Xplore

Synthetic Training Data Generation and Domain Randomization for Object Detection in the Formula Student Driverless Framework


Abstract:

Today industries strive toward using data-driven machine learning wherever applicable. Consequently, they re-quire manually or automatically labeled training data sets. C...Show More

Abstract:

Today industries strive toward using data-driven machine learning wherever applicable. Consequently, they re-quire manually or automatically labeled training data sets. Currently, synthetically generating labeled training data sets belongs to the open challenges in machine learning across multiple application fields. In this paper, we propose employing a procedural pipeline combining BlenderProc with domain randomization to create prelabeled training data sets synthetically. Randomizing the domain using uncorrelated random background images, we ensure that the neural network applied for object detection purely learns the object features and is background-independent. Our proposed pipeline yields a solution to create sizeable prelabeled training data sets. We assess the pipeline performance for the application of cone object detection for the formula student driverless competition using no real training and a small real-world training data set for fine-tuning: We show that using the synthetically generated training data fine-tuned with a limited real training data set performs best for object detection. This transfer learning-based, fine-tuned solution also outperforms the benchmark training data set in detecting knocked-ver cones that are neither present in the real nor the synthetic training data set. Consequently, by combining BlenderProc and domain randomization, we provide a solution for formula student teams to generate extensive training data for cone detection and other detection problems relevant to driverless.
Date of Conference: 16-18 November 2022
Date Added to IEEE Xplore: 30 December 2022
ISBN Information:
Conference Location: Maldives, Maldives

I. Introduction

The awareness that recording and labeling vast amounts of data are necessary to employ machine learning is pervasive. It raises the question of how to replace manual labeling of categorical data features for machine learning or reduce the required training data sizes. Therefore, methods for synthetic training data are gaining popularity and attention across mul-tiple application areas. Specifically, the field of autonomous driving requires object detection and hence desires labeled data sets. Student competitions like the formula student competition raise students' interest in the field of driverless and constitute ideal development grounds for future driverless systems.

Contact IEEE to Subscribe

References

References is not available for this document.