I. Introduction
Autonomous robot exploration and active simultaneous localization and mapping (SLAM) have played important roles in different exploration tasks of unknown environments such as subterranean caves, exoplanets, maritime activities, etc [1], [2], [3], [4], and [5]. Early exploration methods such as the probabilistic roadmap (PRM) [6], the rapidly-exploring random trees (RRT) [7], and their derivatives (e.g., RRT*/PRM* [8], Neural RRT* [9]) aimed to explore an interested area rapidly to find a near-optimal/optimal path given kinodynamic constraints of the robot or prior map information.