Conferences >2023 IEEE 26th International ...

Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Pedestrian intention prediction is crucial for autonomous driving. In particular, knowing if pedestrians are going to cross in front of the ego-vehicle is core to perform...Show More

Metadata

Abstract:

Pedestrian intention prediction is crucial for autonomous driving. In particular, knowing if pedestrians are going to cross in front of the ego-vehicle is core to performing safe and comfortable maneuvers. Creating accurate and fast models that predict such intentions from sequential images is challenging. A factor contributing to this is the lack of datasets with diverse crossing and non-crossing (C/NC) scenarios. We address this scarceness by introducing a framework, named ARCANE, which allows programmatically generating synthetic datasets consisting of C/NC video clip samples. As an example, we use ARCANE to generate a large and diverse dataset named PedSynth. We will show how PedSynth complements widely used real-world datasets such as JAAD and PIE, so enabling more accurate models for C/NC prediction. Considering the onboard deployment of C/NC prediction models, we also propose a deep model named PedGNN, which is fast and has a very low memory footprint. PedGNN is based on a GNN-GRU architecture that takes a sequence of pedestrian skeletons as input to predict crossing intentions. ARCANE, PedSynth, and PedGNN is publicly released¹¹https://github.com/NomiMalik0207/PedSynth-and-PedGNN-for-Pedestrian-Intention-Prediction.

Published in: 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)

Date of Conference: 24-28 September 2023

Date Added to IEEE Xplore: 13 February 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ITSC57777.2023.10422401

Conference Location: Bilbao, Spain

Funding Agency:

Contents

I. Introduction

As evidenced in an early Google self-driving car report [24], the 10% of their self-driving malfunctions on streets were due to incorrect behavior predictions of other road users, including pedestrians. While there have been significant efforts to improve the accuracy of pedestrian intention prediction [15], [3], [16], [23], [18], [36], [5], there is still ample room for improvement. Currently, two datasets, JAAD [28] and PIE [27], are being used to benchmark such prediction models. In these datasets, the core ground truth (GT) consists of labeling if pedestrians are crossing or are going to cross in front of the ego vehicle. As for other onboard perception tasks (e.g., object detection and tracking [4], semantic segmentation [39], monocular depth estimation [17]), synthetic datasets have been proposed to train C/NC prediction models [1], [2]. We propose to go beyond these datasets by introducing a framework, named ARCANE²

ARCANE stands for adversarial cases for autonomous vehicles, the generic project supporting the development of the framework.

, where traffic scenarios of pedestrian behavior can be programmatically defined. This opens the possibility of introducing underrepresented vehicle-to-pedestrian traffic situations. For being aligned with the research community, ARCANE has been developed on top of the CARLA simulator [11]. As an example, we have used ARCANE to generate PedSynth which is a large and diverse synthetic dataset with pedestrian C/NC labels. Note that this type of labeling is not provided by the CARLA simulator, but it is generated by ARCANE. PedSynth consists of 947 video clips of pedestrian C/NC situations. Each video clip runs ~ 20s at 30fps, so resulting in approximately 5 H and 26 min of labeled videos. Figure 1 shows several frames of two video clips from PedSynth. On the other hand, users can generate their own datasets by working with ARCANE. Fig. 1:

Summary of two video clips from PedSynth. Top rows: a pedestrian crosses the road perpendicularly to the ego-vehicle moving direction. Bottom rows: a pedestrian change the intention of crossing the road at mid-lane. In both examples, the pedestrians enter the road at locations not enabled for crossing.

References is not available for this document.

MIT Libraries

MIT Libraries

Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References