pytesseract-ocr
asyncio tesseract wrapper for Tesseract-OCR
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features