Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch | IEEE Conference Publication | IEEE Xplore