zacc806 5e86aef25e | ||
---|---|---|
.github | ||
cmaprsrc | ||
docs | ||
pdfminer | ||
samples | ||
tests | ||
tools | ||
.flake8 | ||
.gitignore | ||
CHANGELOG.md | ||
CONTRIBUTING.md | ||
LICENSE | ||
MANIFEST.in | ||
Makefile | ||
README.md | ||
mypy.ini | ||
noxfile.py | ||
requirements.txt | ||
setup.py | ||
some.py |
README.md
How to use
-
Install Python 3.6 or newer.
-
Install pdfminer.six.
pip install pdfminer.six
-
(Optionally) install extra dependencies for extracting images.
pip install 'pdfminer.six[image]'
-
Install pytesseract.
sudo apt install tesseract-ocr
-
Install poppler.
sudo apt install poppler-utils