I. Introduction
Feature description is the corner stone of numerous visual measurement tasks: geometric visual reconstruction [well known as structure from motion (SfM)] [1], simultaneous localization and mapping (SLAM) [2], [3], visual localization [4], detection and tracking [5], and so on. Early efforts paid attention to the design of compact, general, and efficient handcrafted descriptors relying on local intensity comparison [6]–[9] and gradient statistics [10]–[13]. Although some well-designed handcrafted descriptors, such as SIFT [10] and SURF [12], have been widely used in various visual tasks for their simplicity and practical performance, they disregard useful patterns hidden in the data and, thus, fail to provide reliable matches in a complex environment scenarios [14].