JETIREXPLORE- Search Thousands of research papers



Published in:

Volume 5 Issue 8
August-2018
eISSN: 2349-5162

Unique Identifier

JETIR1808813

Page Number

422-426

Share This Article


Title

A SURVEY ON INFORMATION RETRIEVAL USING VARIOUS TECHNIQUES

ISSN

2349-5162

Cite This Article

"A SURVEY ON INFORMATION RETRIEVAL USING VARIOUS TECHNIQUES", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.5, Issue 8, page no.422-426, August-2018, Available :http://www.jetir.org/papers/JETIR1808813.pdf

Abstract

Structured data, typically, is predefined data. Semi-structured and unstructured data are not predefined data that includes documents, emails, social media posts, images, videos, etc. Text extraction is a critical stage of analyzing Journal papers. Journal papers generally are in PDF format which is semi structured data. Journal papers are presented into different sections like Introduction, Methodology, Experimental, Result, Conclusion etc. It makes easy to analyze based on readers interested topic. The main importance on section extraction is to find a representative subset of the data, which contains the information of the entire set. To extract research papers, we can approach machine learning, NLP, etc. In this paper we present review of various extraction techniques from a PDF document. Data consolidation is used to combine the extracted data to obtain structured data from papers. This will make the knowledge extraction process easy to manage and analyze.

Key Words

Information extraction, Text Mining, NLP, Machine Learning Methods

Cite This Article

"A SURVEY ON INFORMATION RETRIEVAL USING VARIOUS TECHNIQUES", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.5, Issue 8, page no. pp422-426, August-2018, Available at : http://www.jetir.org/papers/JETIR1808813.pdf

Publication Details

Published Paper ID: JETIR1808813
Registration ID: 187521
Published In: Volume 5 | Issue 8 | Year August-2018
DOI (Digital Object Identifier):
Page No: 422-426
ISSN Number: 2349-5162

Download Paper

Preview Article

Download Paper




Cite This Article

"A SURVEY ON INFORMATION RETRIEVAL USING VARIOUS TECHNIQUES", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.5, Issue 8, page no. pp422-426, August-2018, Available at : http://www.jetir.org/papers/JETIR1808813.pdf




Preview This Article


Downlaod

Click here for Article Preview