I. Introduction
Graph neural networks (GNNs) are one of the key tools to extract features from networked data [2], [3], [4], [5] with applications in wireless communications [6], [7], [8], multi-agent coordination [9], [10], [11], [12] and recommendation systems [13], [14], [15]. GNNs employ a multi-layered architecture with each layer comprising a graph filter bank and a nonlinearity [16], where the key operation is to shift the signal over the underlying graph and aggregate the information of neighboring nodes to extract features. With the distributed nature of graph filters, GNNs can compute outputs with only local neighborhood information. This makes them well-suited for decentralized tasks over networks, where each node takes actions by only communicating with its neighbors [17], [18]. GNNs utilize the underlying graph topology as prior knowledge for feature extraction and compute actions with extra neighborhood information through message exchanges, which yield superior performance compared to non-graph based approaches in decentralized settings [8], [9], [17], [19], [20], [21], [22], [23].