UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 5 Issue 7
July-2018
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR1807054


Registration ID:
184353

Page Number

336-344

Share This Article


Jetir RMS

Title

Big Data and Web : DISC in Web Analytics For Large Data Popularity of Online News on Mapreduce Clusters

Abstract

Big data is gathering and analyzing the data. Without analytics, it’s just a bunch of data with limited business use. The big amounts of data storing from various sources the significance of analytics has enormously grown making the companies to tap the bunch of data that was considered useless all these years. The importance of big data is given preference as bound to provide results on the fly. The MapReduce paradigm has long been a staple of big data computational strategies. With the development of the Internet with website, people communicate online news articles every day. The percentage of message communication any news article indicates how popular the news is. The objective of the paper is to find the best model and set of feature to predict the popularity of online news, using machine learning techniques. The dataset is collected from UCI Mashable online news website. For the feature set preprocessing is done using unsupervised learning method AddExpression-E 0.0 expression algorithm. We have implemented two classififiers using ZeroR and RepTree classification. We have successfully implemented two clustering algorithms KMeans clustering and Canopy cluster algorithm. Their performances are recorded and compared. The ZeroR gives the best result with less time complexity of 0.05 seconds and canopy clustering took 2.09 seconds. The research can be used in any organization to choose the classifier and cluster algorithm for process the dataset.

Key Words

Big Data, Web, Machine learning; Classification; Clusters

Cite This Article

"Big Data and Web : DISC in Web Analytics For Large Data Popularity of Online News on Mapreduce Clusters", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.5, Issue 7, page no.336-344, July-2018, Available :http://www.jetir.org/papers/JETIR1807054.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Big Data and Web : DISC in Web Analytics For Large Data Popularity of Online News on Mapreduce Clusters", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.5, Issue 7, page no. pp336-344, July-2018, Available at : http://www.jetir.org/papers/JETIR1807054.pdf

Publication Details

Published Paper ID: JETIR1807054
Registration ID: 184353
Published In: Volume 5 | Issue 7 | Year July-2018
DOI (Digital Object Identifier): http://doi.one/10.1729/IJCRT.17975
Page No: 336-344
Country: District uttara Kannada, Karnataka, INDIA .
Area: Science & Technology
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002938

Print This Page

Current Call For Paper

Jetir RMS