I. Introduction
In modern society, the importance of smart grids is increasingly prominent. By integrating advanced network technologies and energy management techniques, smart grids not only enhance energy efficiency but also improve network security and demand-side management capabilities [1]. Among the numerous applications of smart grids, Short-Term Load Forecasting (STLF) plays a crucial role and is widely used in power supply scheduling, electricity pricing, renewable energy integration, and reducing grid maintenance costs [2]. However, due to the time series and nonlinear characteristics of load data, STLF faces significant challenges [3].