Loading web-font TeX/Main/Regular
Deep Learning on Mobile Devices Through Neural Processing Units and Edge Computing | IEEE Conference Publication | IEEE Xplore

Deep Learning on Mobile Devices Through Neural Processing Units and Edge Computing


Abstract:

Deep Neural Network (DNN) is becoming adopted for video analytics on mobile devices. To reduce the delay of running DNNs, many mobile devices are equipped with Neural Pro...Show More

Abstract:

Deep Neural Network (DNN) is becoming adopted for video analytics on mobile devices. To reduce the delay of running DNNs, many mobile devices are equipped with Neural Processing Units (NPU). However, due to the resource limitations of NPU, these DNNs have to be compressed to increase the processing speed at the cost of accuracy. To address the low accuracy problem, we propose a Confidence Based Offloading (CBO) framework for deep learning video analytics. The major challenge is to determine when to return the NPU classification result based on the confidence level of running the DNN, and when to offload the video frames to the server for further processing to increase the accuracy. We first identify the problem of using existing confidence scores to make offloading decisions, and propose confidence score calibration techniques to improve the performance. Then, we formulate the CBO problem where the goal is to maximize accuracy under some time constraint, and propose an adaptive solution that determines which frames to offload at what resolution based on the confidence score and the network condition. Through real implementations and extensive evaluations, we demonstrate that the proposed solution can significantly outperform other approaches.
Date of Conference: 02-05 May 2022
Date Added to IEEE Xplore: 20 June 2022
ISBN Information:

ISSN Information:

Conference Location: London, United Kingdom

Funding Agency:

Citations are not available for this document.

I. Introduction

Deep Neural Networks (DNN) have been successfully applied to various computer vision and natural language processing problems. Recently, many applications based on DNNs have been developed to provide more intelligent video analytics. For example, some drones such as DJI Mavic Pro can recognize and follow a target based on video analytics; law enforcement officers can use smart glasses to identify suspects [1]. In these applications, only lightweight DNNs can be run locally and their accuracy is much lower than advanced DNNs. Although advanced DNNs can provide us with better results, they also suffer from high computational overhead which means long delay and more energy consumption when running on mobile devices.

Cites in Papers - |

Cites in Papers - IEEE (15)

Select All
1.
Jianxin Shi, Miao Zhang, Linfeng Shen, Jiangchuan Liu, Lingjun Pu, Jingdong Xu, "Towards Neural Codec-Empowered 360^\circ Video Streaming: A Saliency-Aided Synergistic Approach", IEEE Transactions on Multimedia, vol.27, pp.1588-1600, 2025.
2.
Tanmoy Sen, Haiying Shen, "Elastically Scheduling Multiple DNN Inference Jobs on an Edge for High Efficiency and Accuracy", 2024 IEEE 21st International Conference on Mobile Ad-Hoc and Smart Systems (MASS), pp.185-193, 2024.
3.
Rong Cong, Zhiwei Zhao, Linyuanqi Zhang, Geyong Min, "Kite: Link-Adaptive and Real-Time Object Detection in Dynamic Edge Networks", IEEE Transactions on Mobile Computing, vol.23, no.12, pp.15224-15237, 2024.
4.
Ruiqi Wang, Guohong Cao, "Edge-Assisted Relevance-Aware Perception Dissemination in Vehicular Networks", 2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS), pp.715-725, 2024.
5.
Penglin Dai, Yangyang Chao, Xiao Wu, Kai Liu, Songtao Guo, "Context-Aware Offloading for Edge-Assisted On-Device Video Analytics Through Online Learning Approach", IEEE Transactions on Mobile Computing, vol.23, no.12, pp.12761-12777, 2024.
6.
Liang Zhang, Hongzi Zhu, Wen Fei, Yunzhe Li, Mingjin Zhang, Jiannong Cao, Minyi Guo, "Novas: Tackling Online Dynamic Video Analytics With Service Adaptation at Mobile Edge Servers", IEEE Transactions on Computers, vol.73, no.9, pp.2220-2232, 2024.
7.
Tao Wang, Tuo Shi, Xiulong Liu, Jianping Wang, Bin Liu, Yingshu Li, Yechao She, "Minimizing Latency for Multi-DNN Inference on Resource-Limited CPU-Only Edge Devices", IEEE INFOCOM 2024 - IEEE Conference on Computer Communications, pp.2239-2248, 2024.
8.
Hao Bao, Zhi Zhou, Fei Xu, Xu Chen, "COUPLE: Orchestrating Video Analytics on Heterogeneous Mobile Processors", 2024 IEEE 40th International Conference on Data Engineering (ICDE), pp.1561-1574, 2024.
9.
Tianxiang Tan, Guohong Cao, "Thermal-Aware Scheduling for Deep Learning on Mobile Devices With NPU", IEEE Transactions on Mobile Computing, vol.23, no.12, pp.10706-10719, 2024.
10.
Chenxuan Hou, Chenyang Wang, Kai Dong, Xiaofei Wang, Tarik Taleb, "RIS-Assisted Ad Hoc Edge for Optimal User Distribution in Service-Intensive Scenarios", GLOBECOM 2023 - 2023 IEEE Global Communications Conference, pp.1107-1112, 2023.
11.
Renjie Xu, Saiedeh Razavi, Rong Zheng, "Edge Video Analytics: A Survey on Applications, Systems and Enabling Techniques", IEEE Communications Surveys & Tutorials, vol.25, no.4, pp.2951-2982, 2023.
12.
Tianxiang Tan, Guohong Cao, "Deep Learning on Mobile Devices With Neural Processing Units", Computer, vol.56, no.8, pp.48-57, 2023.
13.
Zhichuang Sun, Ruimin Sun, Changming Liu, Amrita Roy Chowdhury, Long Lu, Somesh Jha, "ShadowNet: A Secure and Efficient On-device Model Inference System for Convolutional Neural Networks", 2023 IEEE Symposium on Security and Privacy (SP), pp.1596-1612, 2023.
14.
Pyeongjun Choi, Jeongho Kwak, "A Survey on Mobile Edge Computing for Deep Learning", 2023 International Conference on Information Networking (ICOIN), pp.652-655, 2023.
15.
Tanmoy Sen, Haiying Shen, Zakaria Mehrab, "Distributed Deep Learning in An Edge Computing System", 2022 IEEE 19th International Conference on Mobile Ad Hoc and Smart Systems (MASS), pp.645-653, 2022.

Cites in Papers - Other Publishers (1)

1.
Tymoteusz Miller, Irmina Durlik, Ewelina Kostecka, Paulina Mitan-Zalewska, Sylwia Sokołowska, Danuta Cembrowska-Lech, Adrianna Łobodzińska, "Advancements in Artificial Intelligence Circuits and Systems (AICAS)", Electronics, vol.13, no.1, pp.102, 2023.
Contact IEEE to Subscribe

References

References is not available for this document.