ISSN: 2349-5162 | Impact Factor: 5.87

JETIREXPLORE- Search Thousands of research papers



Published in:

Volume 2 Issue 4
April-2015
eISSN: 2349-5162

Unique Identifier

JETIR1504084

Page Number

1292-1296

Share This Article


Jetir RMS

Indexing Partner


Title

A Survey on Text Mining and Sentiment Analysis for Unstructured Web Data

Download Paper

Abstract

Unstructured data refers to information that doesn’t have a pre-defined data archetype. Unstructured information is typically textual data, but may also contain numerical data, and factual details. This results in data that is obscure, irregular and ambiguous, thus making it difficult to analyse using conventional computing means. Much of the data in the web, in the form of blogs, news, social media platforms is unstructured. But they serve as a potential vast source of information, if processed efficiently. In this paper, the basics of harnessing unstructured data from the web and the techniques to process it are discussed. The concepts of web crawling, text mining and natural language processing are discussed in brief, to give an outline of how web data is processed and analysed. Sentiment Analysis, which is a major aspect of present day NLP, is also described, along with issue of mining from Twitter, which has emerged as the most important data source for NLP in the recent past. The paper concludes with a brief outline of the use of web data mining and analysis, and the potential for future growth in the field.

Key Words

Data Mining, Natural Language Processing (NLP), Sentiment Analysis, Text Mining, Web Crawling

Cite This Article

"A Survey on Text Mining and Sentiment Analysis for Unstructured Web Data", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.2, Issue 4, page no. pp1292-1296, April-2015, Available at : www.jetir.org & http://www.jetir.org/JETIR1504084

Publication Details

Published Paper ID: JETIR1504084
Registration ID: 150376
Published In: Volume 2 | Issue 4 | Year April-2015
DOI (Digital Object Identifier):
Page No: 1292-1296
ISSN Number: 2349-5162

Download Paper


UGC and ISSN Approved Journal | Call For Paper
(Volume 4 | Issue 11 | November 2017 |Impact factor 5.87)

Call For Paper | Volume 4 | Issue 11 | Impact factor 5.87


Important Dates Related to Publication Procedure:
  • » Paper Submission Till: 29 November 2017.
  • » UGC and ISSN Approved Journal
  • » Review (Acceptance/Rejection) Notification: Within 02-04 Days.
  • » Paper Publish:Within 02-07 Days after submitting the all documents.
  • » Frequency: Monthly (12 issue Annually)
  • » Journal Type : Open Access
  • » Publication Charges : 1300 INR
  • »Indexing In Google Scholar, ResearcherID Thomson Reuters, Mendeley : reference manager, Academia.edu, arXiv.org, Research Gate, CiteSeerX, DocStoc, ISSUU, Scribd, and many more | High Impact Factor: 5.87 Digital object identifier (DOI) and Hard Copy of certificate Provided.
Publication and Indexing Patner:




Preview This Article


Downlaod

Download PDF

Downloads

000708

Print This Page

Impact Factor

Impact factor: 5.87

Jetir RMS