Using audio and visual cues for speaker diarisation initialisation | IEEE Conference Publication | IEEE Xplore