UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 12 | Issue 12 | December 2025

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 5
May-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2505709


Registration ID:
562138

Page Number

g94-g100

Share This Article


Jetir RMS

Title

Image Caption Generator

Abstract

Image caption generation is the art and science of automatically producing natural language descriptions for images in a way that closely reflects human interpretation. This technique has applications in various fields such as assistive technologies, content-based image retrieval, autonomous systems, and human-computer interaction. Image captioning combines computer vision and natural language processing to generate descriptive sentences based on the contents of an image. Common approaches include the use of Convolutional Neural Networks (CNNs) for feature extraction and Long Short- Term Memory (LSTM) networks for sequence generation. CNNs are responsible for identifying and encoding spatial features from images, while LSTMs decode these features into grammatically coherent and contextually relevant sentences. The primary objective of image captioning is to accurately capture the semantic content of an image and generate a human-like description. Secondary objectives include maintaining fluency, grammatical correctness, and contextual awareness in the generated captions. Image caption generators offer a powerful tool for automated image understanding and annotation. Advances in attention mechanisms, transformer models, and multimodal learning techniques continue to enhance the accuracy and descriptiveness of caption generation systems. However, ongoing research is essential to address challenges such as bias, hallucination in captions, and performance on complex or ambiguous visual scenes, ensuring the long- term utility and fairness of captioning models.

Key Words

I m a g e C a p t i o n i n g , C N N , L S T M , D e e p L e a r n i n g , F e a t u r e E x t r a c t i o n , S e q u e n c e G e n e r a t i o n .

Cite This Article

"Image Caption Generator", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 5, page no.g94-g100, May-2025, Available :http://www.jetir.org/papers/JETIR2505709.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Image Caption Generator", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 5, page no. ppg94-g100, May-2025, Available at : http://www.jetir.org/papers/JETIR2505709.pdf

Publication Details

Published Paper ID: JETIR2505709
Registration ID: 562138
Published In: Volume 12 | Issue 5 | Year May-2025
DOI (Digital Object Identifier):
Page No: g94-g100
Country: Nagole,Hyderabad, Telangana, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000132

Print This Page

Current Call For Paper

Jetir RMS