1 Introduction
The tremendous success of machine learning stimulates a new wave of smart applications. Despite the great convenience, these applications consume massive personal data, at the expense of our privacy. The growing concerns of privacy become one of the major impetus to shift computation from the centralized cloud to users’ end devices such as mobile, edge and IoTs. The current solution supports running on-device inference from a pre-trained model in near real-time [23], [24], whereas their capability to adapt to the new data and learn from each other is still limited.