Optical Character Recognition for PDF Files (Version#1)
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
-> Python 3.5 or higher (recommended) - not tested on 2.7
-> OCR.Space API
-> PyPDF2 Library (for reading and writing PDF document)
1. Get YOUR_API_KEY from https://ocr.space/ocrapi
2. Copy YOUR_API_KEY to pdf_ocr.py
3. Enter your PDF file path to 'path' variable
4. RUN Program, on shell or through some IDE (e-g, spyder)
5. Number of files will be created, same as the number of pages in document.
5. Text File with output will be created.
Happy OCRing :)