Conferences >2024 7th International Confer...

Adaptive load balancing Strategy for VR Microservices with Edge AI

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The pervasiveness of information and communications technologies, such as 5 G and edge AI, has led to the booming growth of video applications, particularly in the realm ...Show More

Metadata

Abstract:

The pervasiveness of information and communications technologies, such as 5 G and edge AI, has led to the booming growth of video applications, particularly in the realm of virtual reality (VR). However, due to the random distribution of the video traffic and the fluctuation of the computing power in Mobile Edge Computing (MEC) networks, traditional load balancing strategies lead to high video service response delay and low resource utilization. To this end, this paper designs and implements an adaptive dynamic load-balancing strategy for VR. Based on the performance monitoring function of the Kubernetes (k8s) system, we first utilize the reinforcement learning algorithm TD3 to find the optimal video service allocation at different edge nodes. Then, considering the real-time GPU utilization of each edge computing node, the queue length, and the average transmission delay, we jointly optimize the average response delay of users and the work efficiency of nodes. Finally, we evaluate the proposed scheme on a real k8s system. Compared with traditional load balancing strategies, the proposed scheme reduces the latency by

$16 \%$ to

$52 \%$ and improves the work efficiency by

$5 \%$ to

$8 \%$ .

Published in: 2024 7th International Conference on Electronics Technology (ICET)

Date of Conference: 17-20 May 2024

Date Added to IEEE Xplore: 18 September 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ICET61945.2024.10672636

Conference Location: Chengdu, China

Contents

I. Introduction

With the rapid development of 5G, Mobile Edge Computing (MEC) has become a promising computing paradigm for delay-sensitive and compute-intensive AI tasks [1]. In MEC scenarios, AI models such as Convolutional Neural Networks (CNNs) are deployed on edge nodes with computing power, while User Equipment (UE) unloads multimedia data to those nodes for model reasoning and obtains the reasoning results [2]. Compared with cloud computing, MEC has a shorter delay since task computing is performed completely at the edge, avoiding the transmission delay of sending data to the cloud. However, the resources of edge nodes are usually limited. When deploying CNNs, MEC networks face the problem of insufficient node computing power and memory resources. As is shown in Figure 1, an effective solution is to split AI tasks into multiple phases and calculate them on different edge nodes, and make up for the insufficient single-node computing power by utilizing the collaboration of edge nodes. Nevertheless, this method generates extra data transmission delay because the intermediate results of the calculation (i.e. tensors) need to be transmitted between edge nodes. When the tasks to be processed on each node are unevenly distributed, high-load nodes will have large queuing delays, which directly affects users’ Quality of Experience (QoE).

References is not available for this document.

Adaptive load balancing Strategy for VR Microservices with Edge AI

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Adaptive load balancing Strategy for VR Microservices with Edge AI

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References