ISSN: 2349-5162 | Impact Factor: 5.87

JETIREXPLORE- Search Thousands of research papers



Published in:

Volume 4 Issue 5
May-2017
eISSN: 2349-5162

Unique Identifier

JETIR1705019

Page Number

83-84

Share This Article


Jetir RMS

Indexing Partner


Title

Information Extraction From Images Using Pytesseract and NLTK

Download Paper

Abstract

Images are used in various fields such as advertisements, business purpose, and spreading awareness. Text data present in these images contain useful and helpful information like contact details, hyperlinks, QR codes. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging in the computer vision research area. But the difficulty in implementation proves to be useful and fruitful. This project aims at using computer vision (Pytesseract) to extract useful information like text, contact details and hyperlinks from images. The android based app would allow user to upload a photo and enable user in storing the contact details, set a remainder, provide summary of the content of the image, opening of hyperlinks directly from the app without needing to type the URL inside the browser. Thus, making the images a more productive and making the job of the user more easy and convenient.

Key Words

Text classification, Machine Learning, Android, Text extraction, Pytesseract, NLTK.

Cite This Article

"Information Extraction From Images Using Pytesseract and NLTK", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.4, Issue 5, page no. pp83-84, May-2017, Available at : www.jetir.org & http://www.jetir.org/JETIR1705019

Publication Details

Published Paper ID: JETIR1705019
Registration ID: 170313
Published In: Volume 4 | Issue 5 | Year May-2017
DOI (Digital Object Identifier):
Page No: 83-84
ISSN Number: 2349-5162

Download Paper


UGC and ISSN Approved Journal | Call For Paper
(Volume 4 | Issue 11 | November 2017 |Impact factor 5.87)

Call For Paper | Volume 4 | Issue 11 | Impact factor 5.87


Important Dates Related to Publication Procedure:
  • » Paper Submission Till: 29 November 2017.
  • » UGC and ISSN Approved Journal
  • » Review (Acceptance/Rejection) Notification: Within 02-04 Days.
  • » Paper Publish:Within 02-07 Days after submitting the all documents.
  • » Frequency: Monthly (12 issue Annually)
  • » Journal Type : Open Access
  • » Publication Charges : 1300 INR
  • »Indexing In Google Scholar, ResearcherID Thomson Reuters, Mendeley : reference manager, Academia.edu, arXiv.org, Research Gate, CiteSeerX, DocStoc, ISSUU, Scribd, and many more | High Impact Factor: 5.87 Digital object identifier (DOI) and Hard Copy of certificate Provided.
Publication and Indexing Patner:




Preview This Article


Downlaod

Download PDF

Downloads

000700

Print This Page

Impact Factor

Impact factor: 5.87

Jetir RMS