UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 6 | June 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 11 Issue 5
May-2024
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2405448


Registration ID:
539908

Page Number

e427-e431

Share This Article


Jetir RMS

Title

Caption Generator of Image to Text using Deep Learning

Abstract

This paper presents an approach to image captioning using a combination of Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks. The ResNET-50 model serves as an encoder to extract meaningful features from images, while the LSTM-based decoder generates coherent and contextually relevant captions. The dataset utilized is the Flickr8k dataset, comprising 8,000 images, each associated with five human-generated captions. This dataset facilitated training the model to understand diverse contexts and generate descriptive captions capturing various aspects of the images. The pre-trained ResNET-50 encoder extracts high-level features from input images, which are then fed into the LSTM-based decoder responsible for generating sequential descriptions. The LSTM network will be trained to grasp temporal dependencies and relationship between the extracted features, thereby ensuring the production of accurate and contextually rich captions.

Key Words

Deep Learning, CNN, ResNet50, VGG16, InceptionV3, Xception, MobileNet, BLEU Score, Long-Short Term Memory(LSTM).

Cite This Article

"Caption Generator of Image to Text using Deep Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.11, Issue 5, page no.e427-e431, May-2024, Available :http://www.jetir.org/papers/JETIR2405448.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Caption Generator of Image to Text using Deep Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.11, Issue 5, page no. ppe427-e431, May-2024, Available at : http://www.jetir.org/papers/JETIR2405448.pdf

Publication Details

Published Paper ID: JETIR2405448
Registration ID: 539908
Published In: Volume 11 | Issue 5 | Year May-2024
DOI (Digital Object Identifier):
Page No: e427-e431
Country: West Godavari District, Andhra Pradesh, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

00025

Print This Page

Current Call For Paper

Jetir RMS