UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 3 | March 2026

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 13 Issue 3
March-2026
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2603757


Registration ID:
577492

Page Number

h448-h453

Share This Article


Jetir RMS

Title

A Comprehensive Review on Deep Learning Techniques for Audio and Speech Signal Processing.

Abstract

This paper presents a comprehensive review of state-of-the-art deep learning techniques applied to audio signal processing, with a focus on speech, music, and environmental sound analysis. Traditional approaches such as Hidden Markov Models (HMM) and Gaussian Mixture Models (GMM) have been widely used but often struggle with real-world noise, speaker variations, and acoustic complexity. Deep learning architectures including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory (LSTM) networks, and Generative Adversarial Networks (GANs) have demonstrated remarkable improvements by automatically learning meaningful features from raw audio signals or log-mel spectrograms. hese models have driven significant advancements in automatic speech recognition, music classification, source separation, and audio enhancement. The paper also explores end-to-end neural frameworks that integrate feature extraction and classification within a single system, improving both accuracy and adaptability. Despite notable progress, challenges remain in terms of data availability, computational cost, and interpretability. This study highlights ongoing innovations addressing these issues and provides insights into how deep learning continues to transform the landscape of modern audio and speech processing, with a focus on cross-domain learning and efficient architectures for real-time applications.

Key Words

Deep Learning,Audio Signal Processing,Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN),Speech Recognition,Environmental,SoundClassification,Source Separation,Audio Enhancement,Audio Synthesis,Feature Extraction,Time-Frequency Representation

Cite This Article

"A Comprehensive Review on Deep Learning Techniques for Audio and Speech Signal Processing.", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.13, Issue 3, page no.h448-h453, March-2026, Available :http://www.jetir.org/papers/JETIR2603757.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"A Comprehensive Review on Deep Learning Techniques for Audio and Speech Signal Processing.", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.13, Issue 3, page no. pph448-h453, March-2026, Available at : http://www.jetir.org/papers/JETIR2603757.pdf

Publication Details

Published Paper ID: JETIR2603757
Registration ID: 577492
Published In: Volume 13 | Issue 3 | Year March-2026
DOI (Digital Object Identifier):
Page No: h448-h453
Country: West Godavari, Andhra Pradesh, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0000

Print This Page

Current Call For Paper

Jetir RMS