I. Introduction
Edge computing brings computation and storage resources closer to the sources of data, facilitating the processing of the client data at the network periphery while meeting stringent response time requirements. Some great progresses have been achieved, especially in the mobile edge computing [1], [2]. However, practical applications often reveal challenges. As shown in Fig. 1, each edge hosts a diverse set of services, and it typically serves multiple clients. The distribution of clients among edges exhibits nonuniformity, with variations in both the number of requests submitted by each client and the specific service they require. This complexity can potentially degrade service quality, especially when an edge is inundated with an excessive number of client requests.
Illustration of unbalanced workloads and resource utilization of edge computing.