I. Introduction
This work seeks to improve the learning rate of a cognitive radar network (CRN) by introducing a central coordinator (CC) to provide limited feedback. Specifically in this work we address the role of a central coordinator within a cognitive radar network using an online learning strategy to achieve coordination as well as optimize radar tracking and spectrum sharing performance. Generally, radar networks achieve superior tracking performance than is possible for a single high-powered radar node [2]. This is due in part to the increased spatial diversity [3] and spectral agility [4]. Distributed nodes can cover a greater area to perform detection, and can exploit more spatial degrees of freedom to more accurately estimate target parameters. However, to obtain this superior performance, the individual radar nodes which comprise the radar network must coordinate with each other to efficiently use the available spectrum and avoid causing harmful interference inside or outside the network. At the root, this problem is caused by a fundamental need to both explore the available channels and simultaneously exploit the best channels (in terms of tracking performance).