I. Introduction
Sensors are increasingly being deployed in complex, real-world systems. Readings from such sensors form Multivariate Time Series (MTS) that in turn are used for understanding and operating the host systems. For instance, the PEMS [1] dataset consists of traffic data from critical locations in a transportation system, and the Electricity [2] dataset records the electricity consumption by key clients in a power system. Consequently, MTS forecasting has become fundamental to understanding and operating complex real-world systems, enabling applications such as traffic management [3], emergency management [4], and resource optimization [5].