I. Introduction
Internet of Things (IoT) generates a huge amount of data that can be exploited by Machine Learning (ML) techniques for provisioning a wide range of intelligent services such as financial services [1], smart transportation [2], [3], and smart home [4]. The end-edge-cloud architecture (Fig. 1) has become the backbone infrastructure for intelligent IoT applications [5]–[8]. In this architecture, end-user devices offload computing tasks to edge nodes and/or the cloud data center for processing. The edge nodes are typically implemented based on various network devices (e.g., access points and IoT gateways) in wireless networks, thus often having constrained computational and communication resources. Conventional ML techniques require the data generated on user/end devices to be transmitted to a central site (e.g., the cloud server), which not only compromises the privacy protection of user data but also consumes huge bandwidth in the IoT network. Therefore, the highly distributed and resource-constrained edge-cloud computing architecture calls for new efficient and privacy-preserving ML methods for smart service provisioning in IoT.