I. INTRODUCTION
With the rapid advancement of multimedia technology, the demand for storing and transmitting videos is constantly increasing. This has driven the demand for more efficient video coding techniques. The latest version of VVC [1] was officially released by the Joint Video Experts Team (JVET) in 2020, achieving a significant 50% reduction in bit-rate compared to its predecessor [2], while maintaining the same quality of decoded videos. Meanwhile, the recent rapid development of deep learning has had a tremendous impact on video coding. Some of these methods have also been actively researched in both academia and the standardization community [3]–[11]. Deep learning-based methods are usually integrated into the hybrid video coding framework to improve the coding performance of each particular module, such as intra prediction [12], [13], inter prediction [14]–[16], in-loop filter [17], [18], post-processing [19], and rate control [20].