Journals & Magazines >IEEE Transactions on Instrume... >Volume: 72

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Image keypoints and descriptors play a crucial role in many visual measurement tasks. In recent years, deep neural networks have been widely used to improve the performan...Show More

Metadata

Abstract:

Image keypoints and descriptors play a crucial role in many visual measurement tasks. In recent years, deep neural networks have been widely used to improve the performance of keypoint and descriptor extraction. However, the conventional convolution operations do not provide the geometric invariance required for the descriptor. To address this issue, we propose the sparse deformable descriptor head (SDDH), which learns the deformable positions of supporting features for each keypoint and constructs deformable descriptors. Furthermore, SDDH extracts descriptors at sparse keypoints instead of a dense descriptor map, which enables efficient extraction of descriptors with strong expressiveness. In addition, we relax the neural reprojection error (NRE) loss from dense to sparse to train the extracted sparse descriptors. Experimental results show that the proposed network is both efficient and powerful in various visual measurement tasks, including image matching, 3-D reconstruction, and visual relocalization.

Published in: IEEE Transactions on Instrumentation and Measurement ( Volume: 72)

Article Sequence Number: 5014016

Date of Publication: 28 April 2023

ISSN Information:

DOI: 10.1109/TIM.2023.3271000

Funding Agency:

Contents

I. Introduction

Efficient and robust extraction of image keypoints and descriptors is critical to many resource-constrained visual measurement applications, such as SLAM [1], computational photography [2], and visual place recognition [3]. Early methods for keypoint detection and descriptor extraction relied on human heuristics [4], [5], [6]. However, these handcrafted methods are not sufficiently efficient and robust. To address these issues, many data-driven approaches based on DNNs have emerged in recent years. Initially, DNNs were used to extract descriptors of image patches at predefined keypoints [7]. Subsequently, the mainstream approach became the extraction of keypoints and descriptors with a single network [8], [9], [10], which can often extract more robust keypoints and discriminative descriptors than handcrafted methods [11]. We refer to these methods as map-based methods because they estimate a score map and a descriptor map using two heads: the SMH and the DMH. Then, they extract keypoints and descriptors from the score map and descriptor map, respectively.

References is not available for this document.

MIT Libraries

MIT Libraries

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References