Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision | IEEE Conference Publication | IEEE Xplore