1. Introduction
The predominant image retrieval methods [1], [32], [34] based on deep learning, typically involve mapping both query and gallery images into a shared feature space that is highly discriminative. Within this space, gallery images are then ranked according to their relevance to the query image. How-ever, this feature mapping process often relies on large neural networks, which pose practical challenges for deployment on edge devices in real-world scenarios. Consequently, this ne-cessitates uploading query images to cloud-based platforms for feature extraction, resulting in dependencies on network connectivity and additional computational overhead.