A review on Speech Emotion Recognition from Raw Audio using LSTM and Neural Networks

Sanjay Chilhate; Anjana Verma; Nitya Khare

Volume 12 Issue 5
May-2025
eISSN: 2349-5162

7.95 impact factor calculated by Google scholar

Published Paper ID:
JETIR2505791

Registration ID:
562783

A review on Speech Emotion Recognition from Raw Audio using LSTM and Neural Networks

Speech Emotion Recognition (SER) has emerged as a crucial domain within human-computer interaction (HCI), enabling machines to identify and respond to users' emotional states. Unlike traditional text-based sentiment analysis, SER relies on auditory cues, making it more complex due to the dynamic and nuanced nature of speech. With the proliferation of deep learning, especially architectures like Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNNs), significant strides have been made in extracting emotional patterns from raw audio data. This review paper delves into recent advancements in SER, focusing primarily on methodologies that bypass extensive feature engineering by utilizing raw waveform data. We comprehensively analyze ten state-of-the-art studies that have contributed novel techniques, including attention mechanisms, hybrid CNN-LSTM models, and end-to-end learning paradigms. Each method is evaluated based on the dataset used, performance metrics (accuracy, F1-score, etc.), and its limitations. A key insight from the review is the increasing reliance on raw audio inputs, eliminating the dependency on handcrafted features such as MFCC or spectrograms. However, existing approaches still struggle with generalization, dataset imbalance, and speaker variability. The paper also presents a brief overview of a proposed LSTM-based architecture designed to enhance robustness across diverse speech signals. Our findings highlight the gaps in current research and suggest directions for future exploration, particularly emphasizing multilingual datasets and unsupervised learning techniques for SER. Keywords Speech Emotion Recognition (SER), Raw Audio, LSTM, Deep Learning, CNN, Attention Mechanism, Human-Computer Interaction, Emotion Detection, End-to-End Learning, Neural Networks

Speech Emotion Recognition (SER), Raw Audio, LSTM, Deep Learning, CNN, Attention Mechanism, Human-Computer Interaction, Emotion Detection, End-to-End Learning, Neural Networks

"A review on Speech Emotion Recognition from Raw Audio using LSTM and Neural Networks", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 5, page no.g891-g894, May-2025, Available :http://www.jetir.org/papers/JETIR2505791.pdf

"A review on Speech Emotion Recognition from Raw Audio using LSTM and Neural Networks", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 5, page no. ppg891-g894, May-2025, Available at : http://www.jetir.org/papers/JETIR2505791.pdf

Published Paper ID: JETIR2505791

Registration ID: 562783

Published In: Volume 12 | Issue 5 | Year May-2025

DOI (Digital Object Identifier):

Page No: g891-g894

Country: Bhopal, MP, India .

Area: Engineering

ISSN Number: 2349-5162

Publisher: IJ Publication

Home |
Contact Us

Contact Us
Click Here

WhatsApp Contact
Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor:

7.95

Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper
Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us Click Here

WhatsApp Contact Click Here

Published in:

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

Unique Identifier

Page Number

Post-Publication

Share This Article

Important Links:

Jetir RMS

Title

Authors

Abstract

Key Words

Cite This Article

ISSN

Cite This Article

Publication Details

Download Paper / Preview Article

Download Paper

Preview This Article

Download PDF

Downloads

Print This Page

Impact Factor: 7.95 Impact Factor Calculation click here

Impact Factor:

7.95

Impact Factor Calculation click here

Current Call For Paper

Call for Paper Cilck Here For More Info

Important Links:

Jetir RMS

Contact Us
Click Here

WhatsApp Contact
Click Here

Impact Factor:

7.95

Impact Factor Calculation click here

Call for Paper
Cilck Here For More Info