I. Introduction
SATELLITE imaging sensors can now acquire images with a spatial resolution of up to 0.41 m. These images, which are usually called very high resolution (VHR) images, have abundant spatial and structural patterns. However, due to the huge volume of the image data, it is difficult to directly access the VHR data containing the scenes of interest. Due to the complex composition and the large number of land-cover types, efficient representation and recognition of the scenes from VHR data have become challenging problems, which have drawn great interest in the remote sensing field [1].