UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 5 Issue 9
September-2018
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR1809485


Registration ID:
188755

Page Number

540-543

Share This Article


Jetir RMS

Title

A Review Paper on Big Data and Hadoop Technology

Authors

Abstract

The term ‘Big Data’ describes innovative techniques and technologies to capture, store, distribute, manage and analyse petabyte- or larger-sized datasets with high-velocity and different structures. The word ‘Big Data’ designates advanced methods and tools to capture, store, distribute, manage and investigate petabyte or larger sized datasets with high velocity and different arrangements. Big data can be organized, unstructured or semi organized, resulting in incapability of predictable data management methods. Put another way, big data is the realization of greater business intelligence by storing, processing, and analysing data that was previously ignored due to the limitations of traditional data management technologies. Hadoop is the main podium for organizing Big Data, and cracks the tricky of creating it convenient for analytics determinations. Hadoop is an open source software project that allows the distributed handling of large datasets across bunches of service servers. It is considered to scale up from a single server to thousands of technologies, with a very high degree of fault tolerance. Big data can be structured, unstructured or semi-structured, resulting in incapability of conventional data management methods. Data is generated from various different sources and can arrive in the system at various rates. In order to process these large amounts of data in an inexpensive and efficient way, parallelism is used. Big Data is a data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it. Hadoop is the core platform for structuring Big Data, and solves the problem of making it useful for analytics purposes. Hadoop is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. It is designed to scale up from a single server to thousands of machines, with a very high degree of fault tolerance.

Key Words

Big Data, Hadoop, Map Reduce, HDFS, Hadoop Components

Cite This Article

"A Review Paper on Big Data and Hadoop Technology", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.5, Issue 9, page no.540-543, September-2018, Available :http://www.jetir.org/papers/JETIR1809485.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"A Review Paper on Big Data and Hadoop Technology", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.5, Issue 9, page no. pp540-543, September-2018, Available at : http://www.jetir.org/papers/JETIR1809485.pdf

Publication Details

Published Paper ID: JETIR1809485
Registration ID: 188755
Published In: Volume 5 | Issue 9 | Year September-2018
DOI (Digital Object Identifier):
Page No: 540-543
Country: --, -, - .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002848

Print This Page

Current Call For Paper

Jetir RMS