I. Introduction
With the emergence of distributed data sets and the rapid growth of computing capability, distributed learning has become a promising mode for deployment of large-scale machine learning [1]. Under such circumstances, due to data privacy and limited communication resources, it is very difficult to transmit all the raw data sets to a central server for learning, especially for applications with widely distributed clients. Thus, to implement machine learning with high efficiency, researchers started to focus on the distributed learning schemes. In [2], the authors proposed the federated learning (FL) scheme for distributed data sets. Such a technique allows applications to collectively reap the benefits of shared models trained from the rich data while avoiding the need of central data collection.