I. Introduction
Classification of hyperspectral images is very challenging due to the limited reference data and the high dimensionality of this kind of data. It is common to collect hundreds of bands simultaneously and to provide hyperspectral images with rich spatial and spectral information benefiting from recent technological evolution [1]. Every pixel is represented as a vector corresponding to reflected light collected by sensors with specific wavelengths [2]–[4]. These vectors form the spectral data with high dimensionality. The classification of hyperspectral images is referred to as assigning pixels to given classes based on some labeled pixels, i.e., the training samples. Conventional pixelwise classifiers treat hyperspectral images as vectors of spectral data with no spatial organization [5]. These methods usually suffer from the low classification accuracy and the inhomogeneous appearance of the classification map. This phenomenon is considered to be caused by the high dimensionality of hyperspectral images and the limited training samples [2], [3]. In fact, the pixels of hyperspectral images are usually highly correlated because of the potential of neighboring pixels belonging to the same class. This property implies dependence among pixels and usually is called spatial information. Spatial information shows great potential to provide additional information during the classification process and thus improves the classification performance greatly. Various methods have been developed for the purpose of integrating the spatial information and the spectral data.