Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task | IEEE Conference Publication | IEEE Xplore