I. Introduction
Convolutional neural networks (CNNs) have demonstrated exceptional success in various computer vision tasks, such as image classification [1], object detection [2], and semantic segmentation [3]. Generally, deeper and wider CNN architectures lead to better performance [4]. However, the significant computational costs associated with such architectures limit their deployment on resource-constrained devices, such as mobile and edge platforms. As a result, pruning deep neural networks to reduce their storage and computational demands has become an active area of research [5] –[9].