Loading [MathJax]/extensions/MathMenu.js
A Framework for Large Scale Semantic Similarity Search on Satellite Imagery | IEEE Conference Publication | IEEE Xplore

A Framework for Large Scale Semantic Similarity Search on Satellite Imagery


Abstract:

Searching for Earth Science phenomena in large archives of Earth Observation Satellite Imagery data requires elaborate processing and spatio-temporal indexing of the imag...Show More

Abstract:

Searching for Earth Science phenomena in large archives of Earth Observation Satellite Imagery data requires elaborate processing and spatio-temporal indexing of the images into categories of the said phenomena. Manual tagging is laborious as it needs constant monitoring through vast volumes of satellite data, the volume and velocity of which is ever-increasing. A complete re-indexing is also needed when a new phenomenon of interest is to be searched through the data archive. Previous efforts to automate tagging have leveraged Machine Learning (ML) techniques to classify images into phenomena of interest. In this method, multiple ML algorithms, each specifically trained for detecting a particular phenomenon, are used for spatio-temporal indexing. While doing so negates the need for human indexing, the process of creating ML models for identifying a class of phenomena involves significant time and computation overhead. Moreover, ML algorithms require vast amounts of extremely scarce labeled data. Furthermore, the computation needed for re-indexing the data whenever a new phenomenon is added to be tagged is not negligible. We propose an alternative, data-driven framework to search through vast amounts of satellite data, that eliminates the need for manual indexing, labeling, or creating purpose-built ML classifiers. The proposed method leverages Self-Supervised Learning (SSL) techniques to obtain feature vectors that are used for search and retrieval of satellite images. An Approximate Nearest Neighbors (ANN) algorithm is used to cluster and retrieve images exhibiting similar features, and by extension, similar Earth Science phenomena. Our unique contribution in this work is the orchestration of the methodology with various cloud services that facilitates searching through millions of images within a short span of time. To showcase the framework, we created a web interface to search through 21 years worth of daily satellite imagery with global coverage. In this paper,...
Date of Conference: 16-21 July 2023
Date Added to IEEE Xplore: 20 October 2023
ISBN Information:

ISSN Information:

Conference Location: Pasadena, CA, USA

1. INTRODUCTION

In the field of remote sensing, the extensive archive of Moderate Resolution Imaging Spectroradiometer (MODIS) satellite imagery offers opportunities for both scientific exploration and practical applications. MODIS provides comprehensive coverage of the Earth's surface every one to two days. This results in a vast amount of data, which, while invaluable, also presents significant challenges due to its sheer volume and complexity. A critical task in this domain is image similarity search, which is essential for applications such as pattern recognition or change or anomaly detection. Traditional methods for image similarity search often rely on handcrafted features or supervised machine learning, which may not fully capture the intricate patterns and structures in the data, particularly when the classes of interest are not fully known beforehand. Moreover, obtaining labels for supervised learning is cumbersome and expensive.

Contact IEEE to Subscribe

References

References is not available for this document.