1. INTRODUCTION
Over the past few years, comprehensive systems of classification, such as decision tree and hierarchical classifier, have been widely applied in classification of remote sensing images [1]. For example, Monteiro and Murphy [2] took into account the improvement of each split of the tree ensemble and calculated a relative measure of the input information. Li et.al.[3] adopted pure pixel index (PPI) to extract endmembers as training samples and used C4.5 decision tree algorithm to classify. Bakos and Gamba [4] introduced a novel methodology to build a multistage hierarchical data processing approach that is able to combine the advantages of different processing chains, which may be best suited for special classes. Jensen et.al.[5] introduced a framework for interpreting a broad range of such regularization matrices in the linear and quadratic discriminant analysis (LDA and QDA, respectively) classifier settings. Suju et.al.[6] used the binary hierarchical classifier to propose a knowledge transfer framework that leverages the information extracted from the existing labeled data to classify spatially separate and multi-temporal test data. Bruzzone et.al.[7] researched the issue of bidirectional reflectance and pointed out the naive use of a classifier which is trained on the available ground-truth data from one region on data that are from spatially or temporally different areas without accounting for the variability of the class signatures, would result in poor classification accuracies.