I. Introduction
With the rapid development of 5G and the Internet of Things (IoT) technology, massive smart devices need to access the network and the amount of task data in the network reaches an unprecedented size [1]. Moreover, diverse emerging sophisticated Internet applications are emerging, such as online gaming, virtual reality, augmented reality, etc., most of which have strict requirements for latency, security, and other metrics [2]. In order to provide high quality service to user equipments (UEs) and guarantee their quality of service (QoS) requirements, it is necessary to perform a large number of computing tasks on wireless devices in a short duration. However, UEs usually have a limited battery capacity and limited computation capability, making it challenging to handle these computation-intensive and latency-sensitive tasks locally. Although cloud computing can offload the computing tasks of UEs to cloud servers to relieve the pressure of mobile devices, in the case of IoT, offloading massive computing data to cloud servers can result in core network congestion and excessive delay issue.