Conferences >2025 3rd International Confer...

Text-to-Image GAN for Realistic Flower Image Synthesis using Textual Input

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This research investigates a unique method of using GANs to generate pictures, primarily for the purpose of creating lifelike floral images from textual input of flower n...Show More

Metadata

Abstract:

This research investigates a unique method of using GANs to generate pictures, primarily for the purpose of creating lifelike floral images from textual input of flower names. The GAN model was trained to capture the distinctive traits, colors, and textures that are exclusive to each flower category by utilizing a large dataset that includes a variety of photos of different flower varieties. In order to reduce artifacts and improve realism, this technique uses a dual-model structure (Generator and Discriminator) that allows the model to repeatedly revise and enhance the produced pictures. The final model produced accurate findings, effectively capturing the minor differences between flower varieties, as demonstrated by the qualitative analysis of the photos that were produced. Our approach included meticulous data pretreatment, model architecture optimization, and hyper parameter tweaking to guarantee precise color representation and structural clarity in the pictures that were produced. High-quality photos that capture the distinctive characteristics linked to each flower name are produced by optimizing the GAN's capacity to distinguish between different flower varieties through the model architecture and training procedure. This study offers a framework for producing particular picture kinds from descriptive inputs, which may find use in augmented reality, educational resources, and creative design. The produced photos were thoroughly assessed using a sophisticated accuracy criteria for five flower categories: rose, jasmine, hibiscus, periwinkle, and crossandra. With an average accuracy of 87%, the model constantly performed well, demonstrating its strong capacity to create realistic and varied floral pictures. These findings highlight the effectiveness of GANs in tasks involving the production of text to images as well as its potential for wider use in visual synthesis.

Published in: 2025 3rd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT)

Date of Conference: 05-07 February 2025

Date Added to IEEE Xplore: 13 March 2025

ISBN Information:

DOI: 10.1109/IDCIOT64235.2025.10915099

Conference Location: Bengaluru, India

Contents

I. Introduction

Realistic picture generation from textual descriptions is a fascinating and revolutionary area at the nexus of natural language processing and computer vision. Content creation, ecommerce, virtual reality, and accessibility for the blind and visually impaired are just a few of the fields that have seen a surge in interest in text-to-image synthesis, which creates pictures from natural language inputs [1]. The most successful models for this purpose are Generative Adversarial Networks (GANs), which can produce incredibly realistic pictures by learning from matched datasets of images and their textual descriptions [2], [3].

References is not available for this document.

Text-to-Image GAN for Realistic Flower Image Synthesis using Textual Input

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Text-to-Image GAN for Realistic Flower Image Synthesis using Textual Input

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References