Journals & Magazines >IEEE Transactions on Computer... >Volume: 41 Issue: 9

Toward the Predictability of Dynamic Real-Time DNN Inference

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Deep neural networks (DNNs) have been widely used in many cyber–physical systems (CPSs). However, it is still a challenging work to deploy DNNs in real-time systems. In p...Show More

Metadata

Abstract:

Deep neural networks (DNNs) have been widely used in many cyber–physical systems (CPSs). However, it is still a challenging work to deploy DNNs in real-time systems. In particular, the execution time of DNN inference must be predictable, s.t. it could be known whether the runtime inference can complete within a required timing constraint. Moreover, the timing constraints may change dynamically with the runtime environment in many embedded applications, such as autonomous cars. A possible way to meet such dynamic real-time requirements is to execute different subnetworks of a DNN at runtime. However, improper construction of subnetworks may not only introduce unpredictable inference time, s.t. the real-timing constraints could be violated unexpectedly, but also has poor compatibility with the well-optimized machine learning framework (e.g., TensorFlow). In this article, we study the predictability when executing different subnetworks of a DNN. In particular, we present a featurewise runtime adaptation framework for DNN inference, which is implemented and validated on NVIDIA Jetson TX2 and Nano with TensorFlow. The experimental results show that our method can achieve predictable inference time in comparison with the state-of-the-art methods.

Published in: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( Volume: 41, Issue: 9, September 2022)

Page(s): 2849 - 2862

Date of Publication: 14 October 2021

ISSN Information:

DOI: 10.1109/TCAD.2021.3120329

Funding Agency:

Contents

I. Introduction

Deep neural networks (DNNs) have been implemented in many cyber–physical systems (CPSs) for solving complex machine learning problems [1], [2]. However, deploying DNNs in a real-time environment is still subject to many limitations, due to their unpredictable execution times resulting from complex runtime behavior on appropriative hardware. Moreover, in many embedded applications, the timing constraints may change dynamically during runtime, e.g., smart autonomous cars [3] and drones [4], where the system must deal with changes in application requirements, such as the sampling period and resource availability.

References is not available for this document.

MIT Libraries

MIT Libraries

Toward the Predictability of Dynamic Real-Time DNN Inference

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Toward the Predictability of Dynamic Real-Time DNN Inference

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References