1. Introduction
Visual Place Recognition (VPR) consists of determining the geographical location of a place depicted in a given image, by comparing its visual features to a database of previously visited places. The dynamic and ever-changing nature of real-world environments pose significant challenges for VPR [33], [57]. Factors such as varying lighting conditions, seasonal changes and the presence of dynamic elements such as vehicles and pedestrians introduce considerable variability into the appearance of a place. Additionally, changes in viewpoint and image scale can expose previously obscured areas, further complicating the recognition process. These challenges are exacerbated by the operational constraints of VPR systems, which often need to operate in real-time and under limited memory. Consequently, there is a compelling need for efficient algorithms capable of generating compact yet robust representations.
Recall@1 performance comparison between our proposed technique, Bag-of-Queries (BoQ), and current state of the art methods, Conv-AP [3], CosPlace [11], MixVPR [4] and Eigen-Places [12]. ResNet-50 is used as backbone for all techniques. BoQ consistently achieves better performance in various environment conditions such as viewpoint changes (Pitts-250k [44], MapillarySLS [50]), seasonal changes (Nordland [53]), historical locations (AmsterTime [51]) and extreme lightning and weather conditions (SVOX [10]).