zacc806 fd72392833 | ||
---|---|---|
.github | ||
cmaprsrc | ||
docs | ||
pdfminer | ||
samples | ||
tests | ||
tools | ||
venv | ||
.flake8 | ||
.gitignore | ||
CHANGELOG.md | ||
CONTRIBUTING.md | ||
LICENSE | ||
MANIFEST.in | ||
Makefile | ||
README.md | ||
mypy.ini | ||
noxfile.py | ||
requirements.txt | ||
setup.py | ||
some.py | ||
Запрос № 01-7.1_654 от 25.07.2023 (BTS).pdf | ||
Запрос № 01-7.1_654 от 25.07.2023 (BTS).pdf:Zone.Identifier | ||
Исх. № 0145-07-23 от 13.07.2023г. битум ГПК.pdf | ||
Исх. № 0145-07-23 от 13.07.2023г. битум ГПК.pdf:Zone.Identifier | ||
Исх_№_0145_07_23_от_13_07_2023г_ДТ_Бенкала.pdf | ||
Исх_№_0145_07_23_от_13_07_2023г_ДТ_Бенкала.pdf:Zone.Identifier |
README.md
How to use
-
Install Python 3.6 or newer.
-
Install pdfminer.six.
pip install pdfminer.six
-
(Optionally) install extra dependencies for extracting images.
pip install 'pdfminer.six[image]'
-
Install poppler
sudo apt-get update sudo apt-get install poppler-utils
-
Install pytesseract
sudo apt-get install tesseract-ocr