A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games

A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games | IEEE Journals & Magazine | IEEE Xplore