MULTI-MODAL FUSION FOR ENHANCED IMAGE AND SPEECH RECOGNITION IN AI SYSTEMS

Dr. Valli Madhavi Koti; Meduri Sridhar Sarma; Dr. K Jaya Sudha

Share us

Journal of Emerging Technologies and Innovative Research
( An International Scholarly Open Access Journal, Peer-reviewed, Refereed, Crossref DOI Journal )
Impact factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal

UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 2 | February 2026

JETIREXPLORE- Search Thousands of research papers

Contact Us
Click Here

WhatsApp Contact
Click Here

Published in:

Volume 10 Issue 12
December-2023
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIRGA06042

Registration ID:
530124

Page Number

375-384

Post-Publication

Share This Article

Important Links:

Current Issue

Jetir RMS

Title

MULTI-MODAL FUSION FOR ENHANCED IMAGE AND SPEECH RECOGNITION IN AI SYSTEMS

Abstract

QThis research investigates the integration of multi-modal information, specifically images and speech, to enhance the recognition capabilities of artificial intelligence (AI) systems. Adopting an interpretive philosophy and employing a deductive approach, the study explores the potential of dynamic attention mechanisms, semi-supervised learning, and cross-domain adaptation techniques. A descriptive research design is employed, utilizing secondary data collection from reputable academic sources. The research critically evaluates the feasibility and applicability of hardware optimization for efficient multi-modal processing, considering factors like specialized processors and parallel computing. The study presents a thorough analysis of dynamic attention mechanisms, emphasizing their role in dynamically allocating attention across different modalities based on contextual relevance. Additionally, it delves into semi-supervised learning techniques, showcasing their ability to leverage both labeled and unlabeled data for improved recognition performance. Cross-domain adaptation techniques are explored to facilitate the seamless deployment of multi-modal fusion models in diverse real-world scenarios.

Key Words

AI systems, knowledge, connecting, integrating, multi-modal classification, aural, visual information

Cite This Article

"MULTI-MODAL FUSION FOR ENHANCED IMAGE AND SPEECH RECOGNITION IN AI SYSTEMS", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.10, Issue 12, page no.375-384, December-2023, Available :http://www.jetir.org/papers/JETIRGA06042.pdf

ISSN

2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"MULTI-MODAL FUSION FOR ENHANCED IMAGE AND SPEECH RECOGNITION IN AI SYSTEMS", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.10, Issue 12, page no. pp375-384, December-2023, Available at : http://www.jetir.org/papers/JETIRGA06042.pdf

Publication Details

Published Paper ID: JETIRGA06042

Registration ID: 530124

Published In: Volume 10 | Issue 12 | Year December-2023

DOI (Digital Object Identifier):

Page No: 375-384

Country: -, -, India .

Area: Engineering

ISSN Number: 2349-5162

Publisher: IJ Publication

Published Paper URL :: https://www.jetir.org/view?paper=JETIRGA06042

Published Paper PDF: https://www.jetir.org/papers/JETIRGA06042

Download Paper / Preview Article

Downlaod Paper

Downlaod eCertificate, Confirmation Letter

Download Paper

Downlaod Paper
Downlaod eCertificate, Confirmation Letter

Preview This Article

Downlaod
Click here for Article Preview

Download PDF

Downloads

000327

Print This Page

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Volume 13 | Issue 2
February 2026

Call for Paper
Cilck Here For More Info

Important Links:

Current Issue

Archive

Call for Paper

Submit Manuscript online

Jetir RMS

For Authors:

- Sample Paper Format

- Submit Paper Online

- Call For Paper

- Check Your Paper Status

- Undetaking Form

- Donation

- FAQ

Publications

- Current Issue

- Past Issue

- Special Issues

Proposals:

- Join as Reviewer

- Conference Proposal

- Editorial Board

- Join as JETIR Team

- Join as Volenteer

- Join RMS Program

Policies:

- All Journal Policy related information

Article Correction Policy

Payment Terms & Conditions

Privacy Policy

Disclaimer

REFUND POLICY

Peer Review Policy or Peer Review Statement

COPE Ethics and malpractice statement

Open Access Policy

Approval, Association and indexing

Copyright Infringement Claims

Impact Factor Calculation

FAQ

Contact Us

Home |

Contact Us

Follow Us on

Copyright © - All Rights Reserved - JETIR

Developed by JETIR

Contact Us Click Here

WhatsApp Contact Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor: 7.95 Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us
Click Here

WhatsApp Contact
Click Here

Impact Factor:

7.95

Impact Factor Calculation click here

Call for Paper
Cilck Here For More Info