UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 3 | March 2026

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 10
October-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2510476


Registration ID:
570813

Page Number

e634-e640

Share This Article


Jetir RMS

Title

Voice-Guided Object Detection: A Comprehensive Survey

Abstract

Object detection with voice guidance is an emerging domain which integrates the computer vision, human computer interaction and NLP. This can enable to scan the surrounding, interpret the objects and inform to the users through voice command. This system provides assistance to the blind and visually impaired person for navigating on the road or within indoor environment. It also used in the variety of applications including robotics, autonomous vehicle driving, context aware computing, etc. There are few challenges that need to be address including context understanding, real time processing of the surrounding data and object identifications from variety of domains. This paper provides detail review of the various techniques used for the object detection and voice guidance. It reviews various techniques based on the DL, speech to vision modelling, fusion of multimodal data, etc. The various challenges have been identified related to generalization, noise reduction, etc. for design and development of the robust system for voice guided object detection.

Key Words

Voice-guided object detection, Multimodal learning, Speech–vision integration; Human–computer interaction, Assistive technology, Deep learning, Natural language processing, Visual grounding, Audio–visual perception, Transformer models, Context-aware systems, Real-time object detection, Multimodal fusion, Accessibility, Autonomous systems.

Cite This Article

"Voice-Guided Object Detection: A Comprehensive Survey", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 10, page no.e634-e640, October-2025, Available :http://www.jetir.org/papers/JETIR2510476.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Voice-Guided Object Detection: A Comprehensive Survey", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 10, page no. ppe634-e640, October-2025, Available at : http://www.jetir.org/papers/JETIR2510476.pdf

Publication Details

Published Paper ID: JETIR2510476
Registration ID: 570813
Published In: Volume 12 | Issue 10 | Year October-2025
DOI (Digital Object Identifier): https://doi.org/10.56975/jetir.v12i10.570813
Page No: e634-e640
Country: Kolhapur, Maharashtra, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000101

Print This Page

Current Call For Paper

Jetir RMS