Less is More: CLIPBERT for Video-and-Language Learning via Sparse Sampling | IEEE Conference Publication | IEEE Xplore