UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 6 Issue 5
May-2019
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIRBW06034


Registration ID:
207226

Page Number

164-168

Share This Article


Jetir RMS

Title

Characterizing Speakers Using Spectrograms

Abstract

Deep learning models are good at seeing but not so good at hearing. So, when it comes to audio or speech data, we don’t get expected results. Basically, our need was to train a deep learning model, which is able to classify the speeches on the basis of various characteristics of the speaker. In this paper, we tried to visualize differences among speakers on the basis of his Liveliness, Speech Rate and Vocal Depth in the form of images. We did this by taking the audio signals to spatial domain by using a method called Short Term Fourier Transformation. To better visualize the differences we used concept of Mel Spectrograms, which better correlates with human hearing. And we successfully visualized the differences with the help of mel spectrograms. And these differences were very significant and considerable which can be observed by anyone. These observations can be proven very useful in training deep learning models that can evaluate speakers.

Key Words

Spectrograms, Mel Spectrograms, CNNs, Characterizing Speakers

Cite This Article

"Characterizing Speakers Using Spectrograms", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.6, Issue 5, page no.164-168, May-2019, Available :http://www.jetir.org/papers/JETIRBW06034.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Characterizing Speakers Using Spectrograms", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.6, Issue 5, page no. pp164-168, May-2019, Available at : http://www.jetir.org/papers/JETIRBW06034.pdf

Publication Details

Published Paper ID: JETIRBW06034
Registration ID: 207226
Published In: Volume 6 | Issue 5 | Year May-2019
DOI (Digital Object Identifier): http://doi.one/10.1729/Journal.23855
Page No: 164-168
Country: -, -, - .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002908

Print This Page

Current Call For Paper

Jetir RMS