Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation | IEEE Conference Publication | IEEE Xplore