ISSN: 2349-5162 | Impact Factor: 5.87

JETIREXPLORE- Search Thousands of research papers

Published in:

Volume 4 Issue 5
eISSN: 2349-5162

Unique Identifier


Page Number


Share This Article

Jetir RMS

Indexing Partner


Information Extraction From Images Using Pytesseract and NLTK

Download Paper


Images are used in various fields such as advertisements, business purpose, and spreading awareness. Text data present in these images contain useful and helpful information like contact details, hyperlinks, QR codes. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging in the computer vision research area. But the difficulty in implementation proves to be useful and fruitful. This project aims at using computer vision (Pytesseract) to extract useful information like text, contact details and hyperlinks from images. The android based app would allow user to upload a photo and enable user in storing the contact details, set a remainder, provide summary of the content of the image, opening of hyperlinks directly from the app without needing to type the URL inside the browser. Thus, making the images a more productive and making the job of the user more easy and convenient.

Key Words

Text classification, Machine Learning, Android, Text extraction, Pytesseract, NLTK.

Cite This Article

"Information Extraction From Images Using Pytesseract and NLTK", International Journal of Emerging Technologies and Innovative Research ( | UGC and issn Approved), ISSN:2349-5162, Vol.4, Issue 5, page no. pp83-84, May-2017, Available at : &

Publication Details

Published Paper ID: JETIR1705019
Registration ID: 170313
Published In: Volume 4 | Issue 5 | Year May-2017
DOI (Digital Object Identifier):
Page No: 83-84
ISSN Number: 2349-5162

Download Paper

UGC and ISSN Approved Journal | Call For Paper
(Volume 5 | Issue 2 | February 2018 |Impact factor 5.87)

Call For Paper | Volume 5 | Issue 2 | Impact factor 5.87

Important Dates Related to Publication Procedure:
  • » Paper Submission Till: 29 February 2018.
  • » UGC and ISSN Approved Journal
  • » Review (Acceptance/Rejection) Notification: Within 02-04 Days.
  • » Paper Publish:Within 02-07 Days after submitting the all documents.
  • » Frequency: Monthly (12 issue Annually)
  • » Journal Type : Open Access
  • » Publication Charges : 1300 INR
  • »Indexing In Google Scholar, ResearcherID Thomson Reuters, Mendeley : reference manager,,, Research Gate, CiteSeerX, DocStoc, ISSUU, Scribd, and many more | High Impact Factor: 5.87 Digital object identifier (DOI) and Hard Copy of certificate Provided.
Publication and Indexing Patner:

Preview This Article


Download PDF



Print This Page

Impact Factor

Impact factor: 5.87

Jetir RMS