scanned-documents
BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.
Extract content and logical tree structure from textual documents
Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.
Make your PDFs look like they were scanned