Loading [MathJax]/extensions/MathZoom.js
PointGrid: A Deep Network for 3D Shape Understanding | IEEE Conference Publication | IEEE Xplore

PointGrid: A Deep Network for 3D Shape Understanding


Abstract:

Volumetric grid is widely used for 3D deep learning due to its regularity. However the use of relatively lower order local approximation functions such as piece-wise cons...Show More

Abstract:

Volumetric grid is widely used for 3D deep learning due to its regularity. However the use of relatively lower order local approximation functions such as piece-wise constant function (occupancy grid) or piece-wise linear function (distance field) to approximate 3D shape means that it needs a very high-resolution grid to represent finer geometry details, which could be memory and computationally inefficient. In this work, we propose the PointGrid, a 3D convolutional network that incorporates a constant number of points within each grid cell thus allowing the network to learn higher order local approximation functions that could better represent the local geometry shape details. With experiments on popular shape recognition benchmarks, PointGrid demonstrates state-of-the-art performance over existing deep learning methods on both classification and segmentation.
Date of Conference: 18-23 June 2018
Date Added to IEEE Xplore: 16 December 2018
ISBN Information:

ISSN Information:

Conference Location: Salt Lake City, UT, USA
University of Missouri - Columbia
University of Missouri - Columbia

1. Introduction

Deep learning has become a universal tool for many visual recognition tasks ranging from classification to segmentation, especially ConvNets for 2D images [28], [48], [50], [16], [10], [34], [37], [45], [18] thanks to its weight sharing and other kernel optimizations of 2D convolutions. It is therefore natural that a lot of researchers currently aim at the adaptation of deep ConvNets to 3D models. Such adaptation is, however, non-trivial due to the nature of 3D data representations. Currently the 3D geometry shape representation consists of point, mesh and volumetric grid. Mesh is extremely irregular and hence it is very hard to design a framework to directly learn from it. Point is flexible but it is unorganized. Volumetric grid is regular, which enables many researchers to utilize either occupancy grid or distance field as a mean of data representation and learn 3D convolutional networks from it.

University of Missouri - Columbia
University of Missouri - Columbia
Contact IEEE to Subscribe

References

References is not available for this document.