Journals & Magazines >IEEE Transactions on Instrume... >Volume: 72

FSFM: A Feature Square Tower Fusion Module for Multimodal Object Detection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

With the increasing social needs, single-modal data have been unable to provide sufficient information for object detection. Reasonably processing multimodal information ...View more

Metadata

Abstract:

With the increasing social needs, single-modal data have been unable to provide sufficient information for object detection. Reasonably processing multimodal information and fusing specific information of different modal data is one of the research hotspots in the field of data processing. To this end, this article proposes a feature square tower fusion module called FSFM, which is able to realize multimodal feature fusion by aggregating multilevel feature information and is used for object detection. First, the feature square tower strategy is put forward and embedded into the multimodal feature fusion framework. The multilevel features of the two modalities are fused by top-down feature aggregation. Second, by minimizing the foreground and background classification losses, a feature constraint module is constructed to constrain the infrared features to make them more salient. Third, a weighted feature fusion strategy is proposed based on second-order statistics (SOS) to guarantee strong discrimination of the fusion features in different scenarios. Finally, faster R-CNN is applied to detect the fused features. To illustrate the effectiveness of the method, object detection experiments are conducted on the multimodal datasets

$\text{M}^{ \boldsymbol {{3}}}$ FD and multispectral. The results show that the proposed network can achieve better fusion detection effects.

Published in: IEEE Transactions on Instrumentation and Measurement ( Volume: 72)

Article Sequence Number: 2506611

Date of Publication: 13 February 2023

ISSN Information:

DOI: 10.1109/TIM.2023.3244210

Funding Agency:

Contents

I. Introduction

With the continuous development of sensor technology, a variety of different types of data are continuously collected by researchers. The single-modal data with insufficient information can no longer meet the high demand for the accuracy of each task. However, the multimodal fusion technology [1] can fuse different data about the same object from multiple sensors through a unified framework, thereby providing sufficient information for model training. In recent years, the fusion of visible and infrared images [2] is a research hotspot of multimodal fusion technology, which discriminatively characterizes the brightness and heat of objects based on visible and infrared so as to obtain more information about the object and background. Therefore, it has broad application prospects in the fields of object detection, video surveillance, location tracking, and so on [3], [4], [5].

References is not available for this document.

MIT Libraries

MIT Libraries

FSFM: A Feature Square Tower Fusion Module for Multimodal Object Detection

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

FSFM: A Feature Square Tower Fusion Module for Multimodal Object Detection

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References