Learning Optimal Control Policy for Unknown Discrete-Time Systems | IEEE Journals & Magazine | IEEE Xplore