End-to-end audio-scene classification from raw audio: Multi time-frequency resolution CNN architecture for efficient representation learning | IEEE Conference Publication | IEEE Xplore