I. Introduction
The rise in popularity of smartphones, tablets, and wearable users has resulted in an overwhelming of rich data, such as images taken and text inputted by users. On the one hand, there is a potential scope to improve user experience and empower intelligent programs (e.g., analysis of user's activity, analysis of user's sentiment, analysis of image content, clinical care analysis of diabetes and heart disease, burglary analysis in a smart residential area, etc.). On the other hand, it poses privacy concerns, where a user's personal information has to be kept concealed. In this scenario, the users can be prepared with additional data computations rather than direct sharing of the private raw data as they are usually built with large processing and storage resources. As such, Federated Learning (FL) has been recently introduced to meet this purpose [1], [2], [3], [4].