UGC Approved Journal no 63975(19)
New UGC Peer-Reviewed Rules

ISSN: 2349-5162 | ESTD Year : 2014
Volume 12 | Issue 9 | September 2025

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 6 Issue 3
March-2019
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR1903N89


Registration ID:
503847

Page Number

668-674

Share This Article


Jetir RMS

Title

A Clustering Approach to Reduce Outliers in Large Data Sets

Abstract

Detection of outliers from large datasets is a complicated issue in data mining nowadays. Discovering outliers is very useful in the detection of unrevealed and unpredicted data in various areas like credit cards fraud detection, criminal behaviours, etc. Many outlier detection algorithms like depth-based outlier detection algorithm, K-Means, Distance-based, Iterative K-Means, etc. are already being used in various surveys, research and review articles. When used on small datasets, these algorithms performance is almost the same, but as the size increases from 50 documents to 1000, their efficiency starts varying. A new algorithm is proposed considering the bulkiness of data and complexity of time. Clustering of the data is done in a typical manner to optimize the process of reducing outliers. The proposed approach is an enhanced version of depth based clustering algorithm, which is implemented on real-world data sets and compared with other algorithms based on various parameters.

Key Words

Big Data, K -Means, Clustering, Depth Based Outlier Detection, Outlier Detection, Cosine Similarity.

Cite This Article

"A Clustering Approach to Reduce Outliers in Large Data Sets", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.6, Issue 3, page no.668-674, March-2019, Available :http://www.jetir.org/papers/JETIR1903N89.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"A Clustering Approach to Reduce Outliers in Large Data Sets", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.6, Issue 3, page no. pp668-674, March-2019, Available at : http://www.jetir.org/papers/JETIR1903N89.pdf

Publication Details

Published Paper ID: JETIR1903N89
Registration ID: 503847
Published In: Volume 6 | Issue 3 | Year March-2019
DOI (Digital Object Identifier):
Page No: 668-674
Country: -, -, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000117

Print This Page

Current Call For Paper

Jetir RMS