I. Introduction
Estimating dense depth from an online sequence of monocular images, such as those coming from a video stream, is a crucial component of many computer vision applications. For example, dense depth estimates are useful for enabling special 3D effects in augmented reality type applications [33], or for various computational photography applications like geometrically consistent scene editing [43] or relighting [8]. Online estimation of monocular depth from video also has important uses in real-time applications, such as with self-driving vehicles that rely on consumer camera systems instead of or in addition to LiDAR and RADAR [10].