Abstract:
Networked 360^\circ video has become increasingly popular. Despite the immersive experience for users, its sheer data volume, even with the latest H.266 coding and view...Show MoreMetadata
Abstract:
Networked 360^\circ video has become increasingly popular. Despite the immersive experience for users, its sheer data volume, even with the latest H.266 coding and viewport adaptation, remains a significant challenge to today's networks. Recent studies have shown that integrating deep learning into video coding can significantly enhance compression efficiency, providing new opportunities for high-quality video streaming. In this work, we conduct a comprehensive analysis of the potential and issues in applying neural codecs to 360^\circ video streaming. We accordingly present \mathsf {NETA}, a synergistic streaming scheme that merges neural compression with traditional coding techniques, seamlessly implemented within an edge intelligence framework. To address the non-trivial challenges in the short viewport prediction window and time-varying viewing directions, we propose implicit-explicit buffer-based prefetching grounded in content visual saliency and bitrate adaptation with smart model switching around viewports. A novel Lyapunov-guided deep reinforcement learning algorithm is developed to maximize user experience and ensure long-term system stability. We further discuss the concerns towards practical development and deployment and have built a working prototype that verifies \mathsf {NETA}’s excellent performance. For instance, it achieves a 27% increment in viewing quality, a 90% reduction in rebuffering time, and a 64% decrease in quality variation on average, compared to state-of-the-art approaches.
Published in: IEEE Transactions on Multimedia ( Volume: 27)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Deep Learning ,
- Learning Algorithms ,
- User Experience ,
- Short Window ,
- Deep Reinforcement Learning ,
- Immersive Experience ,
- Video Coding ,
- Visual Saliency ,
- Deep Reinforcement Learning Algorithm ,
- Compression Efficiency ,
- Prediction Window ,
- Neural Compression ,
- Deep Neural Network ,
- Computational Resources ,
- Network State ,
- Quality Of Experience ,
- Computational Requirements ,
- Peak Signal-to-noise Ratio ,
- Markov Decision Process ,
- Video Content ,
- Neural Decoding ,
- User Quality Of Experience ,
- Decoding Model ,
- Saliency Map ,
- Live Streaming ,
- Video Compression ,
- Decoding Time ,
- Inaccurate Predictions ,
- Head-mounted Display ,
- Edge Nodes
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Deep Learning ,
- Learning Algorithms ,
- User Experience ,
- Short Window ,
- Deep Reinforcement Learning ,
- Immersive Experience ,
- Video Coding ,
- Visual Saliency ,
- Deep Reinforcement Learning Algorithm ,
- Compression Efficiency ,
- Prediction Window ,
- Neural Compression ,
- Deep Neural Network ,
- Computational Resources ,
- Network State ,
- Quality Of Experience ,
- Computational Requirements ,
- Peak Signal-to-noise Ratio ,
- Markov Decision Process ,
- Video Content ,
- Neural Decoding ,
- User Quality Of Experience ,
- Decoding Model ,
- Saliency Map ,
- Live Streaming ,
- Video Compression ,
- Decoding Time ,
- Inaccurate Predictions ,
- Head-mounted Display ,
- Edge Nodes
- Author Keywords