Conferences >2022 IEEE International Confe...

Dynamic Multi-Scale Network for Dual-Pixel Images Defocus Deblurring with Transformer

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recent works achieve excellent results in dual-pixel defo-cus deblurring task by using convolutional neural network (CNN), while the scarcity of data limits the explorati...Show More

Metadata

Abstract:

Recent works achieve excellent results in dual-pixel defo-cus deblurring task by using convolutional neural network (CNN), while the scarcity of data limits the exploration and attempt of vision transformer in this task. In this paper, we propose a dynamic multi-scale network, named DMT-Net, for dual-pixel images defocus deblurring. In DMTNet, the feature extraction module is composed of several vision transformer blocks, which uses its powerful feature extraction capability to obtain robust features. The reconstruction module is composed of several Dynamic Multi-scale Sub-reconstruction Module (DMSSRM). DMSSRM restores images by adaptively assigning weights to features from differ-ent scales according to the blur distribution and content in-formation of the input images. DMTNet combines the ad-vantages of transformer and CNN, in which the vision trans-former improves the performance ceiling of CNN, and the inductive bias of CNN enables transformer to extract more robust features without relying on a large amount of data. Experimental results on the popular benchmarks demonstrate that our DMTNet significantly outperforms state-of-the-art methods.

Published in: 2022 IEEE International Conference on Multimedia and Expo (ICME)

Date of Conference: 18-22 July 2022

Date Added to IEEE Xplore: 26 August 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/ICME52920.2022.9859631

Conference Location: Taipei, Taiwan

Contents

1. Introduction

In modern cameras, dual-pixel sensors (photodiodes) are used for autofocus. When the signals from the left and right sen-sors are phase shifted, defocus blur will appear in the photos. Defocus blur may affect the performance of subsequent com-puter vision tasks. For example, in image semantic or instance segmentation tasks, pixels in the defocus blurring region can-not be segmented correctly. Therefore, defocus deblurring is a fundamental and necessary research to avoid the above problems.

References is not available for this document.

MIT Libraries

MIT Libraries

Dynamic Multi-Scale Network for Dual-Pixel Images Defocus Deblurring with Transformer

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Dynamic Multi-Scale Network for Dual-Pixel Images Defocus Deblurring with Transformer

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?