I. Introduction
Recently, artificial intelligence (AI) has been introduced into the Internet of Things (IoT) frontier to seek efficient solutions, which paves the way for the burgeoning Artificial Intelligence of Things (AIoT) [1], [2], [3]. Especially, deep learning has been proven to be a promising enabler for efficient information retrieval from a tremendous amount of data, in the fields of computer vision and pattern recognition. The great explosion of IoT devices brings the exponential growth of data, where the number of IoT devices will reach 41.6 billion by 2025. To achieve deep insights from the data gathered by IoT devices, edge intelligence has emerged as an enabler to enhance AIoT system performance by providing resources at the edge of the network. However, the limitations of network resources and the concern of privacy leakage make the centralized edge training framework unsuitable for future communication networks [4]. To address the aforementioned issues, federated learning (FL) is becoming a potential approach dedicated to preserving privacy by distributing training tasks among distributed IoT devices [5], [6], [7]. Each IoT device serves as a client to train a shared deep neural network (DNN) model by local data and uploads the computed updates to a server for aggregation.