1. Introduction
Detecting objects directly from the 3D point cloud is challenging yet imperative in many computer vision tasks, such as autonomous navigation, path planning for robotics, as well as some AR applications. The goal of 3D object detection is to localize all valid shapes and recognize their semantic label simultaneously, which puts forward high requirements for understanding the whole input scene.
Illustration of the importance of DisARM. (b) It is easy to mistake the cabinet as a table when the point cloud is incomplete and featureless. (c) Redundant relations are usually incomplete and lose the important displacement information of the target object. (d) The network can recognize and locate the cabinet easily with the help of DisARM which provides valid surrounding environment information.