Aggregating Frame-Level Information in the Spectral Domain With Self-Attention for Speaker Embedding | IEEE Journals & Magazine | IEEE Xplore