I. Introduction
There is a wide range of target applications for depth estimation, from obstacle detection to object measurement and from 3D reconstruction to image enhancement. Underwater depth estimation (note that here depth refers to the object range, and not to the depth underwater) is important for Autonomous Underwater Vehicles (AUVs) [15] (Fig. 1), localization and mapping, motion planning, and image dehazing [6]. As such inferring depth from vision systems has been widely investigated in the last years. There is a range of sensors and imaging setups that can provide depth, such as stereo, multiple-view, and time-of-flight (ToF) [11], [12], [23]. Monocular depth estimation is different from other vision systems in the sense that it uses a single RGB image with no special setup or hardware, and as such has many advantages. Because of mechanical design considerations, in many AUVs, it is difficult to place a stereo setup with a baseline that is wide enough, so monocular depth there is particularly attractive and can be combined with other sensors (e.g., Sonars) to set the scale.