355 B
355 B
How to use
-
Install Python 3.6 or newer.
-
Install pdfminer.six.
pip install pdfminer.six
-
(Optionally) install extra dependencies for extracting images.
pip install 'pdfminer.six[image]'
-
Install poppler
sudo apt-get update sudo apt-get install poppler-utils
-
Install pytesseract
sudo apt-get install tesseract-ocr