End-to-End Multi-Person Audio/Visual Automatic Speech Recognition | IEEE Conference Publication | IEEE Xplore