ISSN: 2349-5162 | Impact Factor: 5.87

JETIREXPLORE- Search Thousands of research papers



Published in:

Volume 4 Issue 5
May-2017
eISSN: 2349-5162

Unique Identifier

JETIR1705019

Page Number

83-84

Share This Article


Jetir RMS

Indexing Partner


Title

Information Extraction From Images Using Pytesseract and NLTK

Abstract

Images are used in various fields such as advertisements, business purpose, and spreading awareness. Text data present in these images contain useful and helpful information like contact details, hyperlinks, QR codes. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging in the computer vision research area. But the difficulty in implementation proves to be useful and fruitful. This project aims at using computer vision (Pytesseract) to extract useful information like text, contact details and hyperlinks from images. The android based app would allow user to upload a photo and enable user in storing the contact details, set a remainder, provide summary of the content of the image, opening of hyperlinks directly from the app without needing to type the URL inside the browser. Thus, making the images a more productive and making the job of the user more easy and convenient.

Key Words

Text classification, Machine Learning, Android, Text extraction, Pytesseract, NLTK.

Cite This Article

"Information Extraction From Images Using Pytesseract and NLTK", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.4, Issue 5, page no.83-84, May-2017, Available :http://www.jetir.org/papers/JETIR1705019.pdf

Publication Details

Published Paper ID: JETIR1705019
Registration ID: 170313
Published In: Volume 4 | Issue 5 | Year May-2017
DOI (Digital Object Identifier):
Page No: 83-84
ISSN Number: 2349-5162

Preview This Article


Click here for Article Preview

Download PDF

Downloads

00093

Print This Page

Impact Factor

Impact factor: 5.87

Jetir RMS