1. INTRODUCTION
In the field of remote sensing, the extensive archive of Moderate Resolution Imaging Spectroradiometer (MODIS) satellite imagery offers opportunities for both scientific exploration and practical applications. MODIS provides comprehensive coverage of the Earth's surface every one to two days. This results in a vast amount of data, which, while invaluable, also presents significant challenges due to its sheer volume and complexity. A critical task in this domain is image similarity search, which is essential for applications such as pattern recognition or change or anomaly detection. Traditional methods for image similarity search often rely on handcrafted features or supervised machine learning, which may not fully capture the intricate patterns and structures in the data, particularly when the classes of interest are not fully known beforehand. Moreover, obtaining labels for supervised learning is cumbersome and expensive.