Conferences >2021 IEEE International Confe...

Multi-Graph Based Hierarchical Semantic Fusion for Cross-Modal Representation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The main challenge of cross-modal retrieval is how to efficiently realize semantic alignment and reduce the heterogeneity gap. However, existing approaches ignore the mul...Show More

Metadata

Abstract:

The main challenge of cross-modal retrieval is how to efficiently realize semantic alignment and reduce the heterogeneity gap. However, existing approaches ignore the multi-grained semantic knowledge learning from different modalities. To this end, this paper proposes a novel end-to-end cross-modal representation method, termed as Multi-Graph based Hierarchical Semantic Fusion (MG-HSF). This method is an integration of multi-graph hierarchical semantic fusion with cross-modal adversarial learning, which captures fine-grained and coarse-grained semantic knowledge from cross-modal samples, and generate modalities-invariant representations in a common subspace. To evaluate the performance, extensive experiments are conducted on three benchmarks. The experimental results show that our method is superior than the state-of-the-arts.

Published in: 2021 IEEE International Conference on Multimedia and Expo (ICME)

Date of Conference: 05-09 July 2021

Date Added to IEEE Xplore: 09 June 2021

ISBN Information:

ISSN Information:

DOI: 10.1109/ICME51207.2021.9428194

Conference Location: Shenzhen, China

Funding Agency:

References is not available for this document.

Contents

1. Introduction

With the advent of big data era, the amount of multi-modal data (e.g., image, text, audio, video, 3D model) in the Internet is growing explosively. This trend brings unprecedented challenges of accurate and efficient cross-modal retrieval [1]. As a hot-spot in the area of multimedia, cross-modal retrieval aims to find out objects of different modalities according to a query of a specific modality. This technology can be applied in many scenarios, such as multimedia search, recommendation system, VQA, etc.

References is not available for this document.

Multi-Graph Based Hierarchical Semantic Fusion for Cross-Modal Representation

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Multi-Graph Based Hierarchical Semantic Fusion for Cross-Modal Representation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

1. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?