Policy Iteration Q-Learning for Linear Itô Stochastic Systems With Markovian Jumps and its Application to Power Systems | IEEE Journals & Magazine | IEEE Xplore