HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval | IEEE Conference Publication | IEEE Xplore