test/README.md

19 lines
355 B
Markdown

How to use
----------
* Install Python 3.6 or newer.
* Install pdfminer.six.
`pip install pdfminer.six`
* (Optionally) install extra dependencies for extracting images.
`pip install 'pdfminer.six[image]'`
* Install poppler
`sudo apt-get update
sudo apt-get install poppler-utils`
* Install pytesseract
`sudo apt-get install tesseract-ocr`