I. Introduction
With the rapid development of information technology, various video applications have been widely used in our daily lives, such as network broadcasting, video conferencing and smartphone communications. Meanwhile, users are demanding higher video quality and various ultra-high definition (UHD) video applications are becoming popular, such as 4K/8K video and virtual reality (VR) video, causing the explosive growth of visual data. The previous generation of coding standard, namely High Efficiency Video Coding (HEVC) [1], has gradually failed to meet the market demand for more efficient coding efficiency targets for high-resolution video. As a result, the Joint Video Exploration Team (JVET) has developed the latest generation of video coding standard, i.e., Versatile Video Coding (VVC) [2]. Thanks to a range of new tools, such as the quad-tree plus multi-type tree (QTMT) structure of coding unit (CU) partitioning and the additional intra prediction mode (IPM), VVC achieves significantly higher coding efficiency compared with HEVC [3]. However, its computational complexity is also greatly increased, making VVC unsuitable for real-time applications [4].