I. Introduction
The detection and classification of objects in motion is a classical problem in many fields ranging across health, manufacturing, surveillance, and autonomous transport applications. Traditionally, this problem is addressed from an image processing perspective, where images rendered from various sensing modalities are processed using algorithms that differentiate background and foreground features [1]–[3]. Additionally, the idea of using a system that combines different sensor technologies to tackle this issue has been explored for intelligent vehicles [4], [5] and robotic systems [6].