test/README.md

355 B

How to use

  • Install Python 3.6 or newer.

  • Install pdfminer.six.

    pip install pdfminer.six

  • (Optionally) install extra dependencies for extracting images.

    pip install 'pdfminer.six[image]'

  • Install poppler

    sudo apt-get update sudo apt-get install poppler-utils

  • Install pytesseract sudo apt-get install tesseract-ocr