I. Introduction
The outstanding success of convolutional neural networks (CNNs) for image classification has been supported by the increasing amount of computational power offered by GPUs. Indeed, in the recent years researchers have tried to reproduce such results on devices with limited computational complexity. Manually designed CNNs for mobile devices like Mobilenets [1], Shufflenets [2], and EfficientNets [3] have surged. Based on such efforts, projects like Mnasnet [4] and Fbnet [5] have tried to automatize the process of designing CNNs for custom tasks, given different hardware targets. These techniques gave birth to Hardware-aware neural architecture search (HW NAS).