2019-11-07 20:12:34 +00:00
|
|
|
How to use
|
|
|
|
----------
|
2013-10-22 15:17:12 +00:00
|
|
|
|
2021-08-26 18:53:13 +00:00
|
|
|
* Install Python 3.6 or newer.
|
2022-11-05 15:30:39 +00:00
|
|
|
* Install pdfminer.six.
|
2013-10-22 15:17:12 +00:00
|
|
|
|
2021-08-26 18:53:13 +00:00
|
|
|
`pip install pdfminer.six`
|
2013-10-26 15:05:26 +00:00
|
|
|
|
2022-02-22 19:20:17 +00:00
|
|
|
* (Optionally) install extra dependencies for extracting images.
|
|
|
|
|
2022-08-08 20:21:39 +00:00
|
|
|
`pip install 'pdfminer.six[image]'`
|
2022-02-22 19:20:17 +00:00
|
|
|
|
2023-08-07 12:30:27 +00:00
|
|
|
* Install pytesseract.
|
2019-07-08 21:05:47 +00:00
|
|
|
|
2023-08-07 12:30:42 +00:00
|
|
|
`sudo apt install tesseract-ocr`
|
2021-09-06 20:00:23 +00:00
|
|
|
|
2023-08-07 12:30:27 +00:00
|
|
|
* Install poppler.
|
2021-09-06 20:00:23 +00:00
|
|
|
|
2023-08-07 12:30:42 +00:00
|
|
|
`sudo apt install poppler-utils`
|