Language Query-Based Transformer With Multiscale Cross-Modal Alignment for Visual Grounding on Remote Sensing Images | IEEE Journals & Magazine | IEEE Xplore