I. Introduction
With the advancement of the Internet of Things (IoT), intelligent mobile devices equipped with various sensors can collect and process data at unprecedented scales [1], [2], [3]. The generated massive volumes of data constitute a great source for training deep learning models in mobile computing systems [4], such as enhancing driving safety [5] and inferring emotional states [6]. However, users with mobile devices are usually not willing to upload their potentially privacy-sensitive data directly, which may cause severe private information leakages, such as personal position and gender. Furthermore, though deep learning has shown state-of-the-art performance in various application scenarios such as Computer Vision (CV) [7], [8], [9] and Natural Language Processing (NLP) [10], it is computationally expensive generally and requires enormous amounts of training data, especially with the trend of increasing model size, which is impractical for mobile devices.