I. Introduction
In the past few years, 3D visual communications have developed tremendously, allowing for the real-time capture and transmission of events [1]–[3]. Several space-time representations are then made possible, each with different characteristics. Multiview-plus-depth video is based on multiple 2D projections of 3D scenes, which can then be efficiently compressed, for example, with the multiview extension of the High Efficiency Video Coding standard, MV-HEVC [4] [5]. Polygonal meshes, on the other hand, represent 3D scenes with connected polygons [6]. Point-clouds also directly represent 3D scenes (Fig. 1), but by indicating points' colors in 3D space (voxels) [3].