Gaussian Kernel-Based Cross Modal Network for Spatio-Temporal Video Grounding | IEEE Conference Publication | IEEE Xplore