A Generic Markov Decision Process Model and Reinforcement Learning Method for Scheduling Agile Earth Observation Satellites | IEEE Journals & Magazine | IEEE Xplore