Video Question Answering Using Clip-Guided Visual-Text Attention | IEEE Conference Publication | IEEE Xplore