Journals & Magazines >IEEE Transactions on Neural N... >Volume: 35 Issue: 4

Graph-Based Contrastive Learning for Description and Detection of Local Features

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Confronted with the task environment full of repetitive textures, the state-of-art description and detection methods for local features greatly suffer from the “pseudo-ne...Show More

Metadata

Abstract:

Confronted with the task environment full of repetitive textures, the state-of-art description and detection methods for local features greatly suffer from the “pseudo-negatives,” bringing inconsistent optimization objectives during training. To address this problem, this article develops a self-supervised graph-based contrastive learning framework to train the model for local features, GCLFeat. The proposed approach learns to alleviate the pseudo-negatives specifically from three aspects: 1) designing a graph neural network (GNN), which focuses on mining the local transformational invariance across different views and global textual knowledge within individual images; 2) generating the dense correspondence annotations from a diverse natural dataset with a self-supervised paradigm; and 3) adopting a keypoints-aware sampling strategy to compute the loss across the whole dataset. The experimental results show that the unsupervised framework outperforms the state-of-the-art supervised baselines on diverse downstream benchmarks including image matching, 3-D reconstruction and visual localization. The code will be made public and available at https://github.com/RealZihaoWang/GCLFeat.

Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 35, Issue: 4, April 2024)

Page(s): 4839 - 4851

Date of Publication: 30 September 2022

ISSN Information:

PubMed ID: 36178997

DOI: 10.1109/TNNLS.2022.3208837

Funding Agency:

Contents

I. Introduction

Establishing accurate correspondences among sequential images plays a crucial role in many computer vision tasks, including wide-baseline stereo [1], image retrieval [2], large-scale visual localization [3], structure-from-motion [4], and 3-D construction [5]. Such correspondences are generally estimated by matching the local features, which can be subdivided into keypoints detection and description. The learning-based description is supervised under contrastive learning, which repulses negative pairs (noncorresponding keypoints) while attracting positive pairs (corresponding keypoints).

References is not available for this document.

MIT Libraries

MIT Libraries

Graph-Based Contrastive Learning for Description and Detection of Local Features

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Graph-Based Contrastive Learning for Description and Detection of Local Features

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References