Generating Accurate and Diverse Audio Captions through Variational Autoencoder Framework | IEEE Journals & Magazine | IEEE Xplore