Formation Control Optimization via Virtual Leader Exploration with Deep Reinforcement Learning for Unmanned Aerial Vehicles | IEEE Conference Publication | IEEE Xplore