1. Introduction
In this paper we consider the problem of Multi-View Scene Capture-using multiple cameras to simultaneously recover the shape, reflectance and non-rigid motion of an unknown scene that evolves through time in a completely unknown way. While many techniques exist for recovering one of these properties when the rest of them are known (e.g., capturing the 3D motion of articulated [1], or deformable scenes [2]; reconstructing static Lambertian scenes [3], [4]; and recovering the reflectance of static scenes with known shape [5], [6]), our focus here is on the general case. In particular, how can we capture 3D scenes whose appearance depends on time-varying interactions between shape, reflectance, illumination, and motion? Answering this question would go a long way toward reconstructing many common real-world scenes that are beyond the current state of the art, including (1) highly-deformable and geometrically-complex surfaces whose shape, motion, self-occlusions and self-shadows change through time (e.g., clothing [7]), (2) non-Lambertian surfaces with complex shape and deformation properties (e.g., mm-scale dynamic representations of the human body), and (3) static or moving 3D objects with specular surfaces [8].