Language as Queries for Referring Video Object Segmentation | IEEE Conference Publication | IEEE Xplore