Skip to content

thestackcoder/OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

OCR

Optical Character Recognition for PDF Files (Version#1)

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

-> Python 3.5 or higher (recommended) - not tested on 2.7
-> OCR.Space API
-> PyPDF2 Library (for reading and writing PDF document)

Instructions

1. Get YOUR_API_KEY from https://ocr.space/ocrapi

2. Copy YOUR_API_KEY to pdf_ocr.py

3. Enter your PDF file path to 'path' variable

4. RUN Program, on shell or through some IDE (e-g, spyder)

5. Number of files will be created, same as the number of pages in document.

5. Text File with output will be created.

Happy OCRing :)

About

Optical Character Recognition v1 - Converting PDF files to plain text especially files with more than one page

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages