UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 6 Issue 4
April-2019
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR1904E57


Registration ID:
206270

Page Number

339-345

Share This Article


Jetir RMS

Title

CLASSIFICATION OF TEXT DOCUMENTS USING JACCARD AND EUCLIDEAN DISTANCE SIMILARITY MEASURES BASED ON WORD SCORES GENERATED BY RAKE ALGORITHM

Abstract

Day by day the availabilityof text documents are growing in an exponential manner. This growth of text documents throwing challenges to the user and research community in terms of classification. In our paper an attempt was made in finding the keywords in automation way using Rake algorithm on multiple text documents. The Rake algorithm finds more number of Candidate Keywords along with their word scores. In this only stop words was considered. The keywords were sub divided in to High, Middle and low level frequency key words based on word scores and then the similarity measures Jaccard and Euclidean Distance were applied to classify the documents. Comparisons were made among the three approaches High, Middle and low frequency keywords and concluded the results.

Key Words

TEXT CLASSIFICATION, KEYWORD EXTRACTION, RAKE, SIMILARITY MEASURE

Cite This Article

"CLASSIFICATION OF TEXT DOCUMENTS USING JACCARD AND EUCLIDEAN DISTANCE SIMILARITY MEASURES BASED ON WORD SCORES GENERATED BY RAKE ALGORITHM ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.6, Issue 4, page no.339-345, April-2019, Available :http://www.jetir.org/papers/JETIR1904E57.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"CLASSIFICATION OF TEXT DOCUMENTS USING JACCARD AND EUCLIDEAN DISTANCE SIMILARITY MEASURES BASED ON WORD SCORES GENERATED BY RAKE ALGORITHM ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.6, Issue 4, page no. pp339-345, April-2019, Available at : http://www.jetir.org/papers/JETIR1904E57.pdf

Publication Details

Published Paper ID: JETIR1904E57
Registration ID: 206270
Published In: Volume 6 | Issue 4 | Year April-2019
DOI (Digital Object Identifier):
Page No: 339-345
Country: Hyderabad, TELANGANA, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002842

Print This Page

Current Call For Paper

Jetir RMS