UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 12 | Issue 10 | October 2025

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 8
August-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2508275


Registration ID:
568016

Page Number

c546-c550

Share This Article


Jetir RMS

Title

Advancements in Vision-Language Models for Zero-Shot Image Understanding

Abstract

Vision-language models (VLMs) have transformed computer vision by enabling zero-shot image understanding, allowing models to generalize to unseen tasks with- out task-specific training. This paper reviews recent advancements in VLMs, focusing on architectures, pretraining strategies, and applications in zero-shot image classification, object detection, and visual reasoning. We propose a framework integrating contrastive learning, multimodal prompt tuning, and baseline prompts to enhance performance. Experiments on ImageNet, MS COCO, and Visual Genome demonstrate superior accuracy and robustness. We address ethical challenges, such as dataset biases, and propose mitigation strategies. Future directions include scalable and fair VLMs for real-world applications.

Key Words

Vision-Language Models, Zero-Shot Learning, Computer Vision, Multimodal Learning, Ethical AI

Cite This Article

"Advancements in Vision-Language Models for Zero-Shot Image Understanding", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 8, page no.c546-c550, August-2025, Available :http://www.jetir.org/papers/JETIR2508275.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Advancements in Vision-Language Models for Zero-Shot Image Understanding", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 8, page no. ppc546-c550, August-2025, Available at : http://www.jetir.org/papers/JETIR2508275.pdf

Publication Details

Published Paper ID: JETIR2508275
Registration ID: 568016
Published In: Volume 12 | Issue 8 | Year August-2025
DOI (Digital Object Identifier):
Page No: c546-c550
Country: chengalpattu, tamil nadu , India .
Area: Science
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000246

Print This Page

Current Call For Paper

Jetir RMS