Loading [MathJax]/extensions/MathZoom.js
Video Frame Interpolation via Direct Synthesis with the Event-based Reference | IEEE Conference Publication | IEEE Xplore

Video Frame Interpolation via Direct Synthesis with the Event-based Reference


Abstract:

Video Frame Interpolation (VFI) has witnessed a surge in popularity due to its abundant downstream applications. Event-based VFI (E-VFI) has recently propelled the ad-van...Show More

Abstract:

Video Frame Interpolation (VFI) has witnessed a surge in popularity due to its abundant downstream applications. Event-based VFI (E-VFI) has recently propelled the ad-vancement of VFI. Thanks to the high temporal resolution benefits, event cameras can bridge the informational void present between successive video frames. Most state-of-the-art E-VFI methodologies follow the conventional VFI paradigm, which pivots on motion estimation between consecutive frames to generate intermediate frames through a process of warping and refinement. However, this reliance engenders a heavy dependency on the quality and consis-tency of keyframes, rendering these methods susceptible to challenges in extreme real-world scenarios, such as missing moving objects and severe occlusion dilemmas. This study proposes a novel E-VFI framework that directly synthesize intermediate frames leveraging event-based reference, obviating the necessity for explicit motion estimation and substantially enhancing the capacity to handle motion occlusion. Given the sparse and inher-ently noisy nature of event data, we prioritize the relia-bility of the event-based reference, leading to the development of an innovative event-aware reconstruction strategy for accurate reference generation. Besides, we implement a bi-directional event-guided alignment from keyframes to the reference using the introduced E-PCD module. Finally, a transformer-based decoder is adopted for prediction re-finement. Comprehensive experimental evaluations on both synthetic and real-world datasets underscore the superiority of our approach and its potential to execute high-quality VFI tasks.
Date of Conference: 16-22 June 2024
Date Added to IEEE Xplore: 16 September 2024
ISBN Information:

ISSN Information:

Conference Location: Seattle, WA, USA

Funding Agency:


1. Introduction

Video frame interpolation (VFI) is an important direction in current computer vision research and finds widespread applications in various domains, including slow-motion generation [1], [16], [46], and video compression [44]. It is primarily used to generate non-existent intermediate frames in video sequences, enabling various effects such as video smoothing and high frame rate conversion. Existing VFI methods mostly employ motion-based strategies, estimating pixel-level motion from keyframes and using warping techniques to obtain interpolated frames. However, these methods have certain limitations when dealing with occlusion and non-linear motion, often struggling to accurately predict motion in complex scenes, thus affecting the quality of interpolated frames.

Qualitative comparison of the occlusion handling. (a) and (b) estimate occlusion mappings by frames, (c) use events for occlusion judgments, and (d) make occlusion judgments for optical flow at the feature level. Our method (e) achieves the best results by no longer estimating the occlusion mapping but giving a direct structural reference.

Contact IEEE to Subscribe

References

References is not available for this document.