I. Introduction
With the rapid development of wireless communications [1], [2], [3], [4], [5], the era of Internet of Things is coming with the sixth generation of mobile networks (6G) [6], [7]. Artificial intelligence (AI) is a key technology for 6G [8], [9], enabling a wide variety of wireless intelligent applications spanning from speech recognition, computer vision to automation [10]. Big data is one of the key drivers that boost AI performance. With the rapid development of processor chips, mobile clients with integrated capabilities of sensing, communication, and computing can generate useful data, which is emerging as a major source to promote the performance of AI applications [11]. Traditionally, the raw data stored in a wide range of clients is collected by companies or institutions. However, this centralized framework for gathering and training has an Achilles’ heel, i.e., it violates data privacy by exposing raw data to the gatherers. Moreover, the available data for centralized training is limited as clients may be reluctant to upload data, constraining model performance. To solve this problem, Google proposed Federated Learning (FL), which is a distributed learning framework over plenty of clients connected to edge networks [12]. The key idea of FL is to train a Machine Learning (ML) model using local computing resources and data without transferring data elsewhere. FL is qualified to guarantee data privacy, while abundant data from extensive clients could be leveraged to improve the model performance. FL is promising to advance intelligent applications and has numerous successful explorations and implementations in IoT [13], [14], [15], intelligent core networks [16], self-driving vehicles [17], [18], and smart health [19], [20], [21], etc.