1. INTRODUCTION
The aim of artificial bandwidth extension (BWE) is to improve the quality of narrowband (NB) telephony speech (0 - 4 kHz) by artificially extending the signal to wideband (WB), i.e., a bandwidth of 0 - 8 kHz. This is achieved by estimating the missing upper band (UB) between 4 and 8 kHz based on the NB spectrum. In most approaches, the signal is first transformed to the frequency domain and then decomposed into a spectral envelope and an excitation signal. Both these parts are then extended separately, which simplifies the estimation problem. The extension of the excitation signal is typically achieved with rather simple methods like spectral shifting. This is perfectly sufficient, as the introduced degradation of the speech quality is rather low [1], [2]. In early BWE approaches, the estimation of WB features was often performed with codebooks, Gaussian mixture models (GMM), or hidden Markov models (HMM).