Conferences >2024 IEEE 6th International C...

An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recently, CNN-based networks have exhibited high performance in computer vision. On the other hand, due to the networks becoming deeper and wider, it is hard to implement...Show More

Metadata

Abstract:

Recently, CNN-based networks have exhibited high performance in computer vision. On the other hand, due to the networks becoming deeper and wider, it is hard to implement the model in real-time embedded environments. To overcome the drawback, filter pruning has been widely studied for neural network compression. Filter pruning does not need any special hardware or software because it removes filters of CNN and accelerates inference without any special software or hardware. In this paper, we proposed efficient and fast filter pruning (EFFP), which focuses on reducing the training computation resources and searching optimal pruned networks. The success stems from two significant improvements upon other pruning methods. (1) Short training time: In the pruning stage, we make redundant filters to zero to make the output feature map the same as a lightweight model, and (2) adjust the change of redundancy using regrowing: It is difficult to get an optimal pruned model by pruning redundant filters at once. Therefore, we use the pruning/regrowing method to gradually remove unimportant filters to avoid permanently pruning important filters to get an optimal lightweight model. Experimental results indicate that EFFP can reduce the FLOPs and parameters more efficiently and faster than other pruning methods on the object detection model. The inference time is measured on NVIDIA Jetson Xavier NX. As a result, we improve mAP and inference time by a maximum of 45 % compared to other pruning methods.

Published in: 2024 IEEE 6th International Conference on AI Circuits and Systems (AICAS)

Date of Conference: 22-25 April 2024

Date Added to IEEE Xplore: 19 July 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/AICAS59952.2024.10595873

Conference Location: Abu Dhabi, United Arab Emirates

Funding Agency:

Contents

I. Introduction

Recently, as computational power has increased and training data has become more diverse, deep learning technology has been applied to various everyday applications, such as computer vision, speech recognition, and natural language processing. In particular, object detection networks in the field of computer vision, based on convolutional neural networks (CNNs), have been actively studied and have shown high performance. However, as the network becomes deeper and wider, many CNNs are used, requiring large memory and computational resources. These over-parameterized networks are unsuitable for operation in practical embedded systems such as drones, phones, and cars, where memory, computing resources, and power are limited. Therefore, a significant challenge is to make CNNs lightweight by considering hardware specifications to use them more efficiently.

References is not available for this document.

MIT Libraries

MIT Libraries

An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

An Efficient and Fast Filter Pruning Method for Object Detection in Embedded Systems

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References