pdfminer.six/README.md

328 B

How to use

  • Install Python 3.6 or newer.

  • Install pdfminer.six.

    pip install pdfminer.six

  • (Optionally) install extra dependencies for extracting images.

    pip install 'pdfminer.six[image]'

  • Install pytesseract.

    sudo apt install tesseract-ocr

  • Install poppler.

    sudo apt install poppler-utils