1. INTRODUCTION
Speech recognizers trained in clean conditions normally perform poorly in noisy environments, due to the mismatch between training and testing data. Numerous efforts have been made to improve either the front-ends [1] [2] [3] or the back-ends [4] [5] of the recognizers for noisy environments. Certain progress have been made, but the overall performance is still not satisfactory, due to several issues. First, many methods impractically require noise information beforehand. Second, methods adapted to certain noise conditions may not generalize well to different noises or even different noise levels. And third, non-stationary noises and low signal-to-noise ratio (SNR) cases is still an open problem.