Loading [MathJax]/extensions/MathMenu.js
Dynamic Multi-Scale Network for Dual-Pixel Images Defocus Deblurring with Transformer | IEEE Conference Publication | IEEE Xplore

Dynamic Multi-Scale Network for Dual-Pixel Images Defocus Deblurring with Transformer


Abstract:

Recent works achieve excellent results in dual-pixel defo-cus deblurring task by using convolutional neural network (CNN), while the scarcity of data limits the explorati...Show More

Abstract:

Recent works achieve excellent results in dual-pixel defo-cus deblurring task by using convolutional neural network (CNN), while the scarcity of data limits the exploration and attempt of vision transformer in this task. In this paper, we propose a dynamic multi-scale network, named DMT-Net, for dual-pixel images defocus deblurring. In DMTNet, the feature extraction module is composed of several vision transformer blocks, which uses its powerful feature extraction capability to obtain robust features. The reconstruction module is composed of several Dynamic Multi-scale Sub-reconstruction Module (DMSSRM). DMSSRM restores images by adaptively assigning weights to features from differ-ent scales according to the blur distribution and content in-formation of the input images. DMTNet combines the ad-vantages of transformer and CNN, in which the vision trans-former improves the performance ceiling of CNN, and the inductive bias of CNN enables transformer to extract more robust features without relying on a large amount of data. Experimental results on the popular benchmarks demonstrate that our DMTNet significantly outperforms state-of-the-art methods.
Date of Conference: 18-22 July 2022
Date Added to IEEE Xplore: 26 August 2022
ISBN Information:

ISSN Information:

Conference Location: Taipei, Taiwan

1. Introduction

In modern cameras, dual-pixel sensors (photodiodes) are used for autofocus. When the signals from the left and right sen-sors are phase shifted, defocus blur will appear in the photos. Defocus blur may affect the performance of subsequent com-puter vision tasks. For example, in image semantic or instance segmentation tasks, pixels in the defocus blurring region can-not be segmented correctly. Therefore, defocus deblurring is a fundamental and necessary research to avoid the above problems.

Contact IEEE to Subscribe

References

References is not available for this document.