Conferences >2019 IEEE/CVF Conference on C...

Single Image Depth Estimation Trained via Depth From Defocus Cues

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Estimating depth from a single RGB images is a fundamental task in computer vision, which is most directly solved using supervised deep learning. In the field of unsuperv...Show More

Metadata

Abstract:

Estimating depth from a single RGB images is a fundamental task in computer vision, which is most directly solved using supervised deep learning. In the field of unsupervised learning of depth from a single RGB image, depth is not given explicitly. Existing work in the field receives either a stereo pair, a monocular video, or multiple views, and, using losses that are based on structure-from-motion, trains a depth estimation network. In this work, we rely, instead of different views, on depth from focus cues. Learning is based on a novel Point Spread Function convolutional layer, which applies location specific kernels that arise from the Circle-Of-Confusion in each image location. We evaluate our method on data derived from five common datasets for depth estimation and lightfield images, and present results that are on par with supervised methods on KITTI and Make3D datasets and outperform unsupervised learning approaches. Since the phenomenon of depth from defocus is not dataset specific, we hypothesize that learning based on it would overfit less to the specific content in each dataset. Our experiments show that this is indeed the case, and an estimator learned on one dataset using our method provides better results on other datasets, than the directly supervised methods.

Published in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 15-20 June 2019

Date Added to IEEE Xplore: 09 January 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPR.2019.00787

Conference Location: Long Beach, CA, USA

Contents

1. Introduction

In classical computer vision, many depth cues were used in order to recover depth from a given set of images. These shape from X methods include structure-from-motion, which is based on multi-view geometry, shape from structured light, in which the known light source plays the role of an additional view, shape from shadow, and most relevant to our work, shape from defocus. In machine learning based computer vision, the interest has mostly shifted into depth from a single image, treating the problem as a multivariant image-to-depth regression problem, with an additional emphasis on using deep learning.

References is not available for this document.

Single Image Depth Estimation Trained via Depth From Defocus Cues

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Single Image Depth Estimation Trained via Depth From Defocus Cues

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References