I. Introduction
Internet of Things (IoTs), equipped with diverse miniatur-ized and low-power sensors, have catalyzed a remarkable surge in continuous and cost -effective pervasive sensing applications in recent years, encompassing environmental monitoring [1], asset tracking [2], and on-body human sensing [3]. On the other hand, advancements in artificial intelligence, especially deep neural networks (DNNs), enable effective extraction of both the explicit and implicit information within the substantial volume of sensor data, leading to enhanced sensing performance and more precise comprehension of the context [4]. For example, recent transformer DNNs like SwinV2 [5] can boost the image recognition accuracy up to 90.17% compared to 52.9% achievable by conventional machining learning like support vector machine (SVM) on the ImageNet dataset [6].
Illustration of (a) top-down and (b) bottom-up methodology.