UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 12 | Issue 11 | November 2025

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 11
November-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2511239


Registration ID:
571510

Page Number

c321-c330

Share This Article


Jetir RMS

Title

AI-Powered Video-RAG for Interactive E-Learning

Abstract

In recent years, artificial intelligence (AI) and multimodal learning technologies have revolutionized the field of e-learning by enabling systems that can understand and generate contextually relevant educational content. However, traditional video-based learning systems still face several limitations, particularly in the areas of information retrieval, personalization, and learner interaction. This research presents an AI-Powered Video Retrieval-Augmented Generation (Video-RAG) system designed specifically for interactive e-learning environments. Inspired by state-of-the-art multimodal retrieval and generative frameworks such as VideoRAG (2025), the proposed model integrates multiple modalities — including video, audio, and textual information — to create a dynamic and adaptive learning experience. The system architecture comprises three core modules: (1) Speech-to-Text Conversion using OpenAI’s Whisper model, which accurately transcribes lecture videos into structured textual data; (2) Multimodal Retrieval utilizing CLIP (Contrastive Language–Image Pretraining) to align visual frames and textual segments for efficient context retrieval; and (3) Generative Response Synthesis, powered by a fine-tuned large language model (LLM), that produces coherent, context-aware answers to user queries. This integration of retrieval and generation capabilities allows students to interactively query lecture content and obtain highly relevant explanations, summaries, or clarifications in real time. The system was evaluated on educational video datasets covering diverse academic domains, including computer science, artificial intelligence, and data science. Quantitative experiments show a significant improvement in retrieval precision, context relevance, and response quality compared to traditional keyword-based or static video learning systems. User studies also reveal enhanced learner engagement, reduced cognitive overload, and higher satisfaction with the system’s adaptive feedback mechanism. Furthermore, the proposed Video-RAG model demonstrates scalability and flexibility, supporting both synchronous and asynchronous learning setups. Overall, this research bridges a crucial gap between multimodal AI systems and interactive e-learning applications by introducing a retrieval-augmented generation framework optimized for long-duration educational videos. The results affirm the potential of Video-RAG architectures to transform digital education by fostering intelligent, context-aware, and personalized learning experiences. The system serves as a foundation for future advancements in AI-driven educational assistants, capable of integrating emotional understanding, adaptive assessment, and domain-specific knowledge generation.

Key Words

AI, Video-RAG, E-Learning, Multimodal Retrieval, Whisper, CLIP, Interactive Learning

Cite This Article

"AI-Powered Video-RAG for Interactive E-Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 11, page no.c321-c330, November-2025, Available :http://www.jetir.org/papers/JETIR2511239.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"AI-Powered Video-RAG for Interactive E-Learning", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 11, page no. ppc321-c330, November-2025, Available at : http://www.jetir.org/papers/JETIR2511239.pdf

Publication Details

Published Paper ID: JETIR2511239
Registration ID: 571510
Published In: Volume 12 | Issue 11 | Year November-2025
DOI (Digital Object Identifier):
Page No: c321-c330
Country: pune, maharashtra, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

00065

Print This Page

Current Call For Paper

Jetir RMS