Video-and-Language (VidL) models and their cognitive relevance | IEEE Conference Publication | IEEE Xplore