Learning Visual Voice Activity Detection with an Automatically Annotated Dataset | IEEE Conference Publication | IEEE Xplore