A command line tool written in python that reads a pdf/zip file and outputs a text file using tesseract OCR engine. Given an appropriate alias you can run Input and output OCR samples are available at ...
The course material is organised into two interactive Notebooks, where the participants can actively follow the instructors' examples, as well as explore the provided source code. In this repository, ...