UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 4 | April 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 9 Issue 1
January-2022
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2201125


Registration ID:
318888

Page Number

b182-b192

Share This Article


Jetir RMS

Title

DATA ANALYTICS DONE BY REAL TIME DATA USING HADOOP AND SPARK

Authors

Abstract

Data Analytics Done by Real Time Data using Hadoop and Spark is a Business Analytics Application. Its main goal is to migrate and transfer the data from client and Application server to HIVE with the help of Spark. This Application is generally handled by Data Engineer. Develop the data mindset and analytical skills to make informed business decisions here the main decision which I made is deals with data which can be migrated and transfer using Avro format and finally comes in parquet format which is default format of spark is dump in hive. Generally we have Client Server, Application Server, Spark and Hive. My Project Deals with these software’s only. In Client Server data is stored in form of Key Value pair and Maximum deals with MySQL. In Application Server is a Key Data. Spark is used for the speed access and manipulate of the data with the help of .jar files .Spark make use of HBASE to convert Avro format to data frames .These Data Frames are dump into HIVE in tabular format (parquet). Benefit is Client can monitor the transfer through log files. Finally internal data serialization and data migration is done.

Key Words

Hadoop, Real Time, Data Engineer, Business Analytics, Key Value, Avro Format, Spark, Hive, HBase, Application Server, Client Server, MySQL.

Cite This Article

"DATA ANALYTICS DONE BY REAL TIME DATA USING HADOOP AND SPARK", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.9, Issue 1, page no.b182-b192, January-2022, Available :http://www.jetir.org/papers/JETIR2201125.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"DATA ANALYTICS DONE BY REAL TIME DATA USING HADOOP AND SPARK", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.9, Issue 1, page no. ppb182-b192, January-2022, Available at : http://www.jetir.org/papers/JETIR2201125.pdf

Publication Details

Published Paper ID: JETIR2201125
Registration ID: 318888
Published In: Volume 9 | Issue 1 | Year January-2022
DOI (Digital Object Identifier):
Page No: b182-b192
Country: Visakhapatnam, Andhra Pradesh, India .
Area: Science & Technology
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000315

Print This Page

Current Call For Paper

Jetir RMS