Loading [MathJax]/extensions/MathMenu.js
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces With Quantifiable Uncertainty | IEEE Journals & Magazine | IEEE Xplore

LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces With Quantifiable Uncertainty


Abstract:

This letter introduces a novel probabilistic mapping algorithm, LatentBKI, which enables open-vocabulary mapping with quantifiable uncertainty. Traditionally, semantic ma...Show More

Abstract:

This letter introduces a novel probabilistic mapping algorithm, LatentBKI, which enables open-vocabulary mapping with quantifiable uncertainty. Traditionally, semantic mapping algorithms focus on a fixed set of semantic categories which limits their applicability for complex robotic tasks. Vision-Language (VL) models have recently emerged as a technique to jointly model language and visual features in a latent space, enabling semantic recognition beyond a predefined, fixed set of semantic classes. LatentBKI recurrently incorporates neural embeddings from VL models into a voxel map with quantifiable uncertainty, leveraging the spatial correlations of nearby observations through Bayesian Kernel Inference (BKI). LatentBKI is evaluated against similar explicit semantic mapping and VL mapping frameworks on the popular Matterport3D and Semantic KITTI data sets, demonstrating that LatentBKI maintains the probabilistic benefits of continuous mapping with the additional benefit of open-dictionary queries. Real-world experiments demonstrate applicability to challenging indoor environments.
Published in: IEEE Robotics and Automation Letters ( Volume: 10, Issue: 4, April 2025)
Page(s): 3102 - 3109
Date of Publication: 06 February 2025

ISSN Information:

Funding Agency:


I. Introduction

Robots require informative world models to autonomously navigate the world, commonly known as maps. Mapping methods represent the geometry of the robot's surroundings and often include semantic information relevant to robotic task success. While some works have proposed mapless autonomous navigation [1], [2], maps are commonly used in robotics due to the ability to leverage temporal information within an interpretable world model.

Contact IEEE to Subscribe

References

References is not available for this document.