Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks | IEEE Conference Publication | IEEE Xplore