Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems | IEEE Journals & Magazine | IEEE Xplore