I. Introduction
Multi-agent pathfinding (MAPF) is a challenging problem with topical applications in robotics, video games, logistics, etc. Typically, in MAPF, agents are confined to a graph, and at each timestep, an agent can either move to an adjacent vertex or stay put [1]. The task of each agent is to reach a predefined goal vertex. If the graph is undirected, the solution can be found in polynomial time [2] while finding the optimal solution with respect to a range of the objective functions is NP-hard [3]. Moreover, if the graph is directed, even the decision variant of MAPF is intractable [4].