Multi-Agent Proximal Policy Optimization for a Deadlock Capable Transport System in a Simulation-Based Learning Environment | IEEE Conference Publication | IEEE Xplore