I. Introduction
Load forecasting, especially short term load forecasting, has gradually become an important part of the development of smart grids. Improving prediction accuracy of short term loads will assist dispatchers to improve safe and stable operation of the power system. The energy consumption of residential buildings accounts for a large part of the energy use [1]. Their loads are highly correlated with residents' behaviors, which are always fluctuating randomly [2], making prediction tasks much more challenging.