Unpaired Image-to-Speech Synthesis With Multimodal Information Bottleneck | IEEE Conference Publication | IEEE Xplore