I. Introduction
The ordinary Least Square Method yields unbiased parameters estimates although the observation vector can be noisy [1], [2]. However, the parameter estimates become biased when the data matrix has noise terms [2], Ch. 7. Equivalently, in the linear regression model, if the regression variables have noise terms, the estimate of the parameter vector loses its consistency. The noise terms in the data matrix may arise, for instance, in system identification or adaptive filtering, where model-order deficiencies or signal noises yield an overdetermined set of equations whose coefficients are noisy [2], Ch. 7–8. Other examples are inverse plant estimation, impulse response estimation, adaptive inverse control, adaptive channel equalization, adaptive infinite-impulse-response (IIR) filtering using the equation-error method, weight-vector updating in neural networks, etc.