Image to Text and Speech Synthesis

Shravya; Keerthi SH; Laxmi V

Volume 10 Issue 5
May-2023
eISSN: 2349-5162

7.95 impact factor calculated by Google scholar

Published Paper ID:
JETIR2305946

Registration ID:
516639

Image to Text and Speech Synthesis

Generating textual descriptions of images has been an important topic in computer vision and natural language processing. A number of techniques based on deep learning have been proposed on this topic. Several existing technologies can perform image-to-text conversion, including optical character recognition (OCR) systems. These systems utilize computer vision algorithms to identify and extract text from images. Additionally, text-to-speech (TTS) systems can convert textual information into audible speech. On the other hand, text-to-image conversion involves generative models like deep neural networks, which can learn to generate images based on textual descriptions. Our proposed methodology involves converting images into text and speech, as well as converting text into images. This process can have various applications, such as assisting visually impaired individuals in understanding visual content or generating visual representations of textual information. Generative Adversarial Network based text to image generator to generate images. A image captioning method trained on real images to generate the captions. Fliker8k dataset is being used. The results of the models using both qualitative and quantitative analysis on popularly used evaluation metrics. A Text-to-speech synthesizer is an application that converts text into spoken word, by analyzing and processing the text using Natural Language Processing (NLP) and then using Digital Signal Processing (DSP) technology to convert this processed text into synthesized speech representation of the text.

Natural Language Processing, Digital Signal Processing, Image captioning, Speech generation, Generative Adversarial Network.

"Image to Text and Speech Synthesis", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.10, Issue 5, page no.j1-j5, May-2023, Available :http://www.jetir.org/papers/JETIR2305946.pdf

"Image to Text and Speech Synthesis", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.10, Issue 5, page no. ppj1-j5, May-2023, Available at : http://www.jetir.org/papers/JETIR2305946.pdf

Published Paper ID: JETIR2305946

Registration ID: 516639

Published In: Volume 10 | Issue 5 | Year May-2023

DOI (Digital Object Identifier):

Country: Bangalore, Karnataka, India .

Area: Engineering

ISSN Number: 2349-5162

Publisher: IJ Publication

Home |
Contact Us

Contact Us
Click Here

WhatsApp Contact
Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor:

7.95

Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper
Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us Click Here

WhatsApp Contact Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor: 7.95 Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us
Click Here

WhatsApp Contact
Click Here

Impact Factor:

7.95

Impact Factor Calculation click here

Call for Paper
Cilck Here For More Info