I. Introduction
The Internet of Things (IoT) enables many widely distributed devices to sense their surrounding states and transmit sensed data back to the central server, which make it especially useful for environmental monitoring, coal mine inspection, etc., [1]. With these advantages, we have witnessed a rapid growth of IoT in recent years [2], [3]. On the other hand, thanks to the emerging deep neural network (DNN) technique in recent years, the application of artificial intelligence comes into reality and becomes popular in many areas nowadays [4]. With the empowering of DNN technique, an IoT network can not only complete its traditional tasks, e.g., environmental monitoring or coal mining inspection, in a better way, but also accomplish many unprecedented jobs, such as intrusion detection or facial detection through video streams, etc. It is worth mentioning that a DNN-based application generally involves heavy computation burden [5]. However, it is challenging for an IoT device with limited computation capability to complete such a heavy task in time. In order to address this issue, mobile-edge computing (MEC) can serve as a promising solution [6], [7]. In an MEC system, an edge server rich in computation capability is deployed in the vicinity of many IoT devices. Every IoT device can offload its computation task to the nearby edge server, which can promise the accomplishment of the computation task with not only low delay but also low energy consumption [8], [9].