Conferences >2022 IEEE International Women...

Bengali Caption Generation for Images Using Deep Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Automatic caption generation from images has evolved into an active research topic that requires Natural Language Processing (NLP) and Computer Vision (CV) to comprehend ...Show More

Metadata

Abstract:

Automatic caption generation from images has evolved into an active research topic that requires Natural Language Processing (NLP) and Computer Vision (CV) to comprehend the image input and represent it in text. This can assist visually impaired people by generating text captions of images to understand their surroundings. In this study, we have presented a Long Short-Term Memory (LSTM) based Recurrent Neural Network (RNN) approach, which can generate natural language for an image. A dataset containing 8,000 images and a total of 37611 captions are utilized for training our model. Besides, VVG16 is employed to extract features from images. Finally, performance is evaluated, which shows an accuracy of 66% and BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores of 0.40, 0.18, 0.11, and 0.03, respectively.

Published in: 2022 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE)

Date of Conference: 30-31 December 2022

Date Added to IEEE Xplore: 19 June 2023

ISBN Information:

DOI: 10.1109/WIECON-ECE57977.2022.10150494

Conference Location: Naya Raipur, India

Contents

I. Introduction

Deep learning is an exciting field where human activity can be simulated using a machine. Data is provided to the machine then the machine can learn from the pattern and relation and predict the output similar to the input data. Image caption generation is a problem in deep learning, where a machine can learn from input data and generate captions for images that describe the image the most. This image captioning system can solve many problems, i.e., blind people can benefit from this system, where an image can be described using a deep learning model and then converted to audio data.

References is not available for this document.

Bengali Caption Generation for Images Using Deep Learning

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Bengali Caption Generation for Images Using Deep Learning

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References