Discretizing Continuous Action Space With Unimodal Probability Distributions for On-Policy Reinforcement Learning | IEEE Journals & Magazine | IEEE Xplore