Multi-Modal Speech Emotion Recognition: Integrating Transformer Models and Contextual  Analysis

Shaikh Mohd Ashfaque; Saif H Shaikh; Hemal Naik; Krishna Handa; Atharv Rasal

Share us

Journal of Emerging Technologies and Innovative Research
( An International Scholarly Open Access Journal, Peer-reviewed, Refereed, Crossref DOI Journal )
Impact factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal

UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 3 | March 2026

JETIREXPLORE- Search Thousands of research papers

Contact Us
Click Here

WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 4
April-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2504D45

Registration ID:
560234

Page Number

n347-n360

Post-Publication

Share This Article

Important Links:

Current Issue

Jetir RMS

Title

Multi-Modal Speech Emotion Recognition: Integrating Transformer Models and Contextual Analysis

Abstract

Speech Emotion Recognition (SER) is an interdisciplinary area that enhances machine understanding of human emotions through voice. It holds vital significance for fields such as healthcare, assistive technologies, intelligent virtual assistants, and affective computing. This paper introduces a comprehensive SER system leveraging deep learning and context-awareness using the Wav2Vec2 transformer model. Our solution uses raw speech input, avoids traditional feature engineering, and improves emotion classification accuracy by integrating speech transcription with sentiment analysis. Using the RAVDESS dataset, we trained and evaluated our model to classify eight emotions with real-time inference capability. Results show 78% overall accuracy with high performance for emotions like Calm, Disgust, and Surprise. This system showcases the future potential of context-enriched emotional AI systems.

Key Words

Audio Classification, Context-Aware Analysis, Deep Learning, Human-Computer Interaction, Real-Time Systems, Speech Emotion Recognition, Transformer Models, Wav2Vec2.

Cite This Article

"Multi-Modal Speech Emotion Recognition: Integrating Transformer Models and Contextual Analysis", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 4, page no.n347-n360, April-2025, Available :http://www.jetir.org/papers/JETIR2504D45.pdf

ISSN

2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Multi-Modal Speech Emotion Recognition: Integrating Transformer Models and Contextual Analysis", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 4, page no. ppn347-n360, April-2025, Available at : http://www.jetir.org/papers/JETIR2504D45.pdf

Publication Details

Published Paper ID: JETIR2504D45

Registration ID: 560234

Published In: Volume 12 | Issue 4 | Year April-2025

DOI (Digital Object Identifier):

Page No: n347-n360

Country: Mumbai, Maharashtra, India .

Area: Engineering

ISSN Number: 2349-5162

Publisher: IJ Publication

Published Paper URL :: https://www.jetir.org/view?paper=JETIR2504D45

Published Paper PDF: https://www.jetir.org/papers/JETIR2504D45

Download Paper / Preview Article

Downlaod Paper

Downlaod eCertificate, Confirmation Letter

Download Paper

Downlaod Paper
Downlaod eCertificate, Confirmation Letter

Preview This Article

Downlaod
Click here for Article Preview

Download PDF

Downloads

000181

Print This Page

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Volume 13 | Issue 3
March 2026

Call for Paper
Cilck Here For More Info

Important Links:

Current Issue

Archive

Call for Paper

Submit Manuscript online

Jetir RMS

For Authors:

- Sample Paper Format

- Submit Paper Online

- Call For Paper

- Check Your Paper Status

- Undetaking Form

- Donation

- FAQ

Publications

- Current Issue

- Past Issue

- Special Issues

Proposals:

- Join as Reviewer

- Conference Proposal

- Editorial Board

- Join as JETIR Team

- Join as Volenteer

- Join RMS Program

Policies:

- All Journal Policy related information

Article Correction Policy

Payment Terms & Conditions

Privacy Policy

Disclaimer

REFUND POLICY

Peer Review Policy or Peer Review Statement

COPE Ethics and malpractice statement

Open Access Policy

Approval, Association and indexing

Copyright Infringement Claims

Impact Factor Calculation

FAQ

Contact Us

Home |

Contact Us

Follow Us on

Copyright © - All Rights Reserved - JETIR

Developed by JETIR

Contact Us Click Here

WhatsApp Contact Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor: 7.95 Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us
Click Here

WhatsApp Contact
Click Here

Impact Factor:

7.95

Impact Factor Calculation click here

Call for Paper
Cilck Here For More Info