An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem | IEEE Conference Publication | IEEE Xplore