I. Introduction
Unmanned aerial vehicles (UAVs) have emerged as attractive solutions for several applications that require high maneuverability and scalability, such as distributed inspection and surveillance [1]. In particular, as defined in [2], nano-UAVs have proven to be safe to operate near people due to their reduced weight, i.e., below 50 g, which makes them an excellent choice for navigating through indoor or cramped environments [3]. Furthermore, they are agile, small enough to fit in the palm of a hand, and their cost-effective hardware facilitates swarm formations. An Internet of Robotic Things (IoRT) swarm [4] allows for a decreased latency and a higher probability of reaching the mission objective due to the intrinsic redundancy of having more than one drone [5]. Finding gas leaks [6], localizing survivors in mines [7], remote health monitoring of COVID-19 patients [4], or machine-to-machine cooperation with the Internet of Things (IoT) devices [8] are only a few examples where nano-UAVs and IoRT [9], with a single agent or a swarm, can be employed. Within such applications, UAVs have to perceive the environment and compute their next movements to enable optimal mission strategy [10]. However, enabling optimal planning requires good knowledge of the map of the environment as demonstrated in [10].