Loading [MathJax]/extensions/MathZoom.js
Text To Image Conversion Using Generative Adversarial NetworK | IEEE Conference Publication | IEEE Xplore

Text To Image Conversion Using Generative Adversarial NetworK


Abstract:

Text-to-image synthesis is a task of making practical photos that fit the textual content descriptions. This is a hard hassle that requires both herbal language know-how ...Show More

Abstract:

Text-to-image synthesis is a task of making practical photos that fit the textual content descriptions. This is a hard hassle that requires both herbal language know-how and pc vision capabilities. Existing AI structures aren't capable of achieve this aim but. However, recent advances in deep mastering have enabled the development of powerful fashions which can study meaningful textual content capabilities and generate realistic photos of precise categories, along with faces, album covers, and room interiors. In this paper, we gift a new deep structure and GAN formula that can successfully integrate those advances in textual content and photograph modeling, and translate visual principles from text to pictures. We show that our version can generate attainable pics of birds and flowers from particular textual content descriptions.
Date of Conference: 29-30 June 2024
Date Added to IEEE Xplore: 08 November 2024
ISBN Information:
Conference Location: Bali, Indonesia
Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
B V Raju Institute of Technology, Narsapur, Telangana State, India
B V Raju Institute of Technology, Narsapur, Telangana State, India

I. Introduction

The method of making practical image from textual content descriptions holds big importance across diverse fields like photograph modifying and laptop-aided layout. Generative Adversarial Networks (GANs) have currently shown promising effects in producing real-global pics. Specifically, conditional-GANs, whilst guided with the aid of text descriptions, can produce pix carefully associated with the provided textual facts. Yet, achieving high-decision, photograph-sensible pics from text poses tremendous demanding situations in GAN training. Attempts to comprise more upsampling layers into cutting-edge GAN fashions, aiming for better resolutions like 256×256, regularly result in schooling instability and nonsensical outputs.

Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
Department of Computer Science and Engineering, B V Raju Institute of Technology, Medak, India
B V Raju Institute of Technology, Narsapur, Telangana State, India
B V Raju Institute of Technology, Narsapur, Telangana State, India
Contact IEEE to Subscribe

References

References is not available for this document.