Gaussian Process Temporal-Difference Learning with Scalability and Worst-Case Performance Guarantees | IEEE Conference Publication | IEEE Xplore