I. Introduction
The exponential growth of mobile devices and the proliferation of IoT devices [1], i.e., devices used by end-users, have led to an unprecedented demand for computational resources [2]. These devices can offer a wide range of network services, such as data analysis, virtual reality, and video games. The explosive growth of network traffic has further intensified the demand for computational resources, placing greater pressure on the processing capabilities of devices. Although these devices are equipped with CPUs capable of processing data, the computational capabilities of these devices are still insufficient to meet the demands of computationally intensive tasks. Furthermore, these devices typically rely on battery power and need to efficiently manage their energy consumption [3], [4]. In this context, it becomes crucial to address the challenges posed by the increasing demand for computational resources while ensuring efficient energy utilization and minimizing latency, thereby enhancing application performance and user satisfaction to ensure a seamless and enjoyable experience with these devices and services.