I. Introduction
The ongoing rapid deployment of IoT and edge device systems has led to the generation of a large amount of time series data [1]. Due to incomplete monitoring of sensors, these time series often contain a large number of missing values, making them difficult to directly utilize [2]. Effectively modeling time series with missing values for prediction is a challenging problem [3]. In this context, Multivariate Time Series Forecasting (MTSF) offers a more comprehensive approach by simultaneously considering the interdependencies among multiple time series. However, when missing values are prevalent, traditional MTSF models struggle to capture temporal and spatial dependencies from past to future effectively [4]. To this end, modeling MTSFMV involves two steps: (i) performing imputation around missing values to alleviate data sparsity, and (ii) extracting temporal and spatial dependencies using known and imputed values. Ignoring either of these two steps will lead to degraded performance [5].