The Reinforce Policy Gradient Algorithm Revisited | IEEE Conference Publication | IEEE Xplore