I. Introduction
Efficient and collision-free navigation in multi-robot systems is fundamental to advancing mobility. The problem, generally referred to as Multi-Robot Path Planning (MRPP) or Multi-Agent Path Finding (MAPF), aims at generating collision-free paths leading robots from their origins to designated destinations. Current approaches can be classified as either coupled or decoupled, depending on the structure of the state space that is searched. While coupled approaches are able to ensure the optimality and completeness of the solution, they involve centralized components, and tend to scale poorly with the number of robots [1], [2]. Decoupled approaches, on the other hand, compute trajectories for each robot separately, and re-plan only in case of conflicts [3], [4], [5]. This can significantly reduce the computational complexity of the planning task, but generally produces suboptimal and incomplete solutions. Balancing optimality and completeness with the complexity of computing a solution, however, is still an open research problem [6], [7].