I. Introduction
The integration of artificial intelligence with Internet of Things (IoT) devices significantly enhances operational efficiency and decision-making capabilities, showcasing a transformative effect across various domains, including smart healthcare, autonomous vehicles, and advanced manufacturing systems. However, these substantial advances have been heavily based on high-performance hardware and sophisticated software, such as deep neural networks (DNNs). Although DNN inference can achieve high accuracy, it comes at the expense of increased execution time and energy overhead. With the expected number of IoT devices exceeding 41.6 billion by 2025, the overall energy consumption of Artificial Intelligence of Things (AIoT) systems could contribute significantly to carbon emissions. In fact, the successful design and deployment of AIoT systems requires addressing several research challenges. In particular, the latency constraint and the sustainability of energy have emerged as significant concerns.