UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 12 | Issue 10 | October 2025

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 12 Issue 10
October-2025
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2510349


Registration ID:
570680

Page Number

d356-d360

Share This Article


Jetir RMS

Title

Hybrid Models for Big Data Processing-Combining Hadoop, Spark, and Machine Learning Frameworks

Abstract

Abstract: People utilize MapReduce implementations to process and Analyze massive volumes of data efficiently. Search engines like Google and other data-intensive applications rely on these frameworks to aggregate and manage vast datasets from heterogeneous sources. Such data is essential for enabling data-driven decision-making, predictive modelling, and the delivery of intelligent services. With the exponential growth of data, scalability, flexibility, and ease of use have become critical for high-performance analytics. MapReduce remains one of the most widely adopted frameworks for large-scale data processing due to its scalability, fault tolerance, and simplicity of programming. However, despite these advantages, MapReduce demonstrates certain performance limitations, particularly in iterative and computationally intensive analytical workloads often required by modern Machine Learning algorithms. Consequently, integrating Machine Learning techniques with MapReduce and Hadoop ecosystems has emerged as a powerful approach for enhancing data processing efficiency, model training, and intelligent pattern discovery. This study explores the intersection of Big Data analytics and Machine Learning within MapReduce-based environments. It investigates performance optimization strategies, hybrid frameworks such as Spark MLlib and Mahout, and discusses how distributed ML algorithms can overcome the limitations of traditional MapReduce processing. The paper aims to provide a comprehensive understanding of these advancements, guide the planning and administration of Big Data and ML-driven projects, and identify promising avenues for future research in scalable intelligent data analytics

Key Words

Keywords- Big Data, Big Data Analytics, MapReduce, Hadoop, NoSQL, HBase, Hive.

Cite This Article

"Hybrid Models for Big Data Processing-Combining Hadoop, Spark, and Machine Learning Frameworks", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.12, Issue 10, page no.d356-d360, October-2025, Available :http://www.jetir.org/papers/JETIR2510349.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Hybrid Models for Big Data Processing-Combining Hadoop, Spark, and Machine Learning Frameworks", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.12, Issue 10, page no. ppd356-d360, October-2025, Available at : http://www.jetir.org/papers/JETIR2510349.pdf

Publication Details

Published Paper ID: JETIR2510349
Registration ID: 570680
Published In: Volume 12 | Issue 10 | Year October-2025
DOI (Digital Object Identifier): https://doi.org/10.56975/jetir.v12i10.570680
Page No: d356-d360
Country: Dist: Hanamkonda, Telangana, India .
Area: Science & Technology
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

00024

Print This Page

Current Call For Paper

Jetir RMS