Conferences >2020 IEEE/CVF Conference on C...

KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image. Even...Show More

Metadata

Abstract:

Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image. Even though the time dependency has been taken into account, current temporal relocalization methods still generally underperform the state-of-the-art one-shot approaches in terms of accuracy. In this work, we improve the temporal relocalization method by using a network architecture that incorporates Kalman filtering (KFNet) for online camera relocalization. In particular, KFNet extends the scene coordinate regression problem to the time domain in order to recursively establish 2D and 3D correspondences for the pose determination. The network architecture design and the loss formulation are based on Kalman filtering in the context of Bayesian learning. Extensive experiments on multiple relocalization benchmarks demonstrate the high accuracy of KFNet at the top of both one-shot and temporal relocalization approaches.

Published in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 13-19 June 2020

Date Added to IEEE Xplore: 05 August 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPR42600.2020.00497

Conference Location: Seattle, WA, USA

Contents

1. Introduction

Camera relocalization serves as the subroutine of applications including SLAM [15], augmented reality [9] and autonomous navigation [45]. It estimates the 6-DoF pose of a query RGB image in a known scene coordinate system. Current relocalization approaches mostly focus on one-shot relocalization for a still image. They can be mainly categorized into three classes [13], [50]: (1) the relative pose regression (RPR) methods which determine the relative pose w.r.t. the database images [3], [29], (2) the absolute pose regression (APR) methods regressing the absolute pose through PoseNet [25] and its variants [23], [24], [60] and (3) the structure-based methods that establish 2D-3D correspondences with Active Search [48], [49] or Scene Coordinate Regression (SCoRe) [52] and then solve the pose by PnP algorithms [18], [42]. Particularly, SCoRe is widely adopted recently to learn per-pixel scene coordinates from dense training data for a scene, due to its ability to form dense and accurate 2D-3D matches even in texture-less scenes [5], [6]. As extensively evaluated in [5], [6], [50], the structure-based methods generally show better pose accuracy than the RPR and APR methods, because they explicitly exploit the rules of the projective geometry and the scene structures [50].

References is not available for this document.

KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References