Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations | IEEE Conference Publication | IEEE Xplore