Hierarchical Threshold Pruning Based on Uniform Response Criterion | IEEE Journals & Magazine | IEEE Xplore

Hierarchical Threshold Pruning Based on Uniform Response Criterion


Abstract:

Convolutional neural networks (CNNs) have been successfully applied to various fields. However, CNNs’ overparameterization requires more memory and training time, making ...Show More

Abstract:

Convolutional neural networks (CNNs) have been successfully applied to various fields. However, CNNs’ overparameterization requires more memory and training time, making it unsuitable for some resource-constrained devices. To address this issue, filter pruning as one of the most efficient ways was proposed. In this article, we propose a feature-discrimination-based filter importance criterion, uniform response criterion (URC), as a key component of filter pruning. It converts the maximum activation responses into probabilities and then measures the importance of the filter through the distribution of these probabilities over classes. However, applying URC directly to global threshold pruning may cause some problems. The first problem is that some layers will be completely pruned under global pruning settings. The second problem is that global threshold pruning neglects that filters in different layers have different importance. To address these issues, we propose hierarchical threshold pruning (HTP) with URC. It performs a pruning step limited in a relatively redundant layer rather than comparing the filters’ importance across all layers, which can avoid some important filters being pruned. The effectiveness of our method benefits from three techniques: 1) measuring filter importance by URC; 2) normalizing filter scores; and 3) conducting prune in relatively redundant layers. Extensive experiments on CIFAR-10/100 and ImageNet show that our method achieves the state-of-the-art performance on multiple benchmarks.
Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 35, Issue: 8, August 2024)
Page(s): 10869 - 10881
Date of Publication: 18 April 2023

ISSN Information:

PubMed ID: 37071515

Funding Agency:


I. Introduction

In the past few years, convolutional neural networks (CNNs) have been successfully applied in various computer vision tasks such as image classification [1], object detection [2], image quality evaluation [3], and image retrieval [4]. However, in recent years, CNNs, such as ResNet [5] and DenseNet [6], have increased to hundreds or even thousands of layers. In such a situation, CNNs’ overparameterization becomes a serious challenge for their deployment in source-limited edge devices. To address this issue, network pruning [7], [8], [9] is proposed, which has attracted extensive attention in the research community.

Contact IEEE to Subscribe

References

References is not available for this document.