UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 13 | Issue 3 | March 2026

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 11
November-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2511154


Registration ID:
570917

Page Number

b427-b432

Share This Article


Jetir RMS

Title

End-to-End Video Speech Transcription and Language Translation Using Deep Learning

Abstract

The Intelligent Multimedia Summarization and Translation System is a unified platform for video, audio, text, and PDF summarization, built with Python, deep learning, and NLP frameworks. It employs CNNs and Transformers for video analysis, context-aware NLP for text and audio, and robust pipelines for document parsing, delivering accurate, coherent summaries through a web-based interface. A key enhancement is the integration of Google Translate APIs, enabling multilingual translation of generated summaries into languages such as English, Hindi, French, Spanish, and German. This combination of summarization and translation ensures global accessibility, supports diverse application domains, and addresses challenges of information overload by offering scalable, cross-cultural solutions.

Key Words

Multimedia Summarization, Deep Learning, Natural Language Processing, Video Summarization, Audio Transcription, PDF Summarization, Multilingual Translation

Cite This Article

"End-to-End Video Speech Transcription and Language Translation Using Deep Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 11, page no.b427-b432, November-2025, Available :http://www.jetir.org/papers/JETIR2511154.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"End-to-End Video Speech Transcription and Language Translation Using Deep Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 11, page no. ppb427-b432, November-2025, Available at : http://www.jetir.org/papers/JETIR2511154.pdf

Publication Details

Published Paper ID: JETIR2511154
Registration ID: 570917
Published In: Volume 12 | Issue 11 | Year November-2025
DOI (Digital Object Identifier):
Page No: b427-b432
Country: Bengaluru, Karnataka, India .
Area: Science & Technology
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

00067

Print This Page

Current Call For Paper

Jetir RMS