Commit Graph

  • 08af32de8f Checkout code and fix typos. Pieter Marsman 2022-03-15 23:11:48 +0100
  • a1dd2ef5b3 Add github action for releasing to pypi if git tag is added. Pieter Marsman 2022-03-15 23:00:21 +0100
  • 43c8fc8557
    Ignore empty characters when analyzing layout (#689) jwyawney 2022-02-22 15:20:26 -0500
  • 121235e24b
    Raise more specific error if Pillow cannot be imported (#714) Pieter Marsman 2022-02-22 20:20:17 +0100
  • 5c5e3a564b
    Update pdfminer/image.py Pieter Marsman 2022-02-22 20:19:21 +0100
  • a4899d8cd8 Update CHANGELOG.md Pieter Marsman 2022-02-12 14:03:36 +0100
  • 261d8e2283 Update docs Pieter Marsman 2022-02-12 14:02:42 +0100
  • ac03164d7e Improve error message Pieter Marsman 2022-02-12 14:01:06 +0100
  • 7c73770f05 Raise specific warning if Pillow cannot be imported Pieter Marsman 2022-02-12 13:51:14 +0100
  • 5f53ff8156 Blacken Pieter Marsman 2022-02-11 23:11:53 +0100
  • c362e10159 Merge branch 'develop' into 449-remove-empty-lines Pieter Marsman 2022-02-11 23:11:14 +0100
  • 301c21f95b Merge branch 'develop' into 580-memory-leak-fix-with-weak-reference Pieter Marsman 2022-02-11 23:10:26 +0100
  • f904d579a5 Merge branch 'develop' into richardpaulhudson/develop Pieter Marsman 2022-02-11 23:05:35 +0100
  • b9a8920cdf
    Check blackness in github actions (#711) Pieter Marsman 2022-02-11 22:46:51 +0100
  • cd97172e30 Add to checklist for PR Pieter Marsman 2022-02-08 22:29:20 +0100
  • 2e26960e7e Add contributing guidelines on using black Pieter Marsman 2022-02-08 22:24:57 +0100
  • f981164c3f Update github action names Pieter Marsman 2022-02-08 22:18:42 +0100
  • 1961c2c859 Merge branch 'develop' into add-black-formatter Pieter Marsman 2022-02-08 22:16:46 +0100
  • 812a7a8429 Blacken code Pieter Marsman 2022-02-08 22:14:20 +0100
  • 4947344a31 Check blackness in github actions Pieter Marsman 2022-02-08 22:03:15 +0100
  • 830acff94c
    Changed `log.info` to `log.debug` in six files (#690) Pedro Nunes 2022-02-08 17:24:00 -0300
  • 908cffb670 Remove from CHANGELOG.md since no functionality has changed Pieter Marsman 2022-02-08 21:22:09 +0100
  • fb50c54c80 Merge branch 'develop' into pedrounes1/develop Pieter Marsman 2022-02-08 21:18:29 +0100
  • b6568cba1f Fix cicd Pieter Marsman 2022-02-08 21:13:06 +0100
  • 92f33e231e Improve CHANGELOG.md Pieter Marsman 2022-02-08 21:09:19 +0100
  • c3478834c2 Improve CHANGELOG.md Pieter Marsman 2022-02-08 21:08:45 +0100
  • e3573b0cae Merge branch 'develop' into 449-remove-empty-lines Pieter Marsman 2022-02-08 21:07:38 +0100
  • 5fbdfb9fbd Format import Pieter Marsman 2022-02-08 21:07:00 +0100
  • 961652db6d Remove changes to lines that are not actually changed Pieter Marsman 2022-02-08 21:05:39 +0100
  • 329d863a74 Simplify code Pieter Marsman 2022-02-08 21:04:02 +0100
  • 9eb1704266 Simplify code Pieter Marsman 2022-02-08 20:50:56 +0100
  • 484cc4b6b6 Simplify code Pieter Marsman 2022-02-08 20:48:54 +0100
  • 9ee5bec59a Merge branch 'develop' into 580-memory-leak-fix-with-weak-reference Pieter Marsman 2022-02-08 20:24:36 +0100
  • 2254306a52 Update README.md batch for Continuous integration Pieter Marsman 2022-02-02 22:53:17 +0100
  • 1145718f1d Merge branch 'develop' into richardpaulhudson/develop Pieter Marsman 2022-02-02 22:45:24 +0100
  • 81f873e105 Update actions.yml so that it will run for all PR's Pieter Marsman 2022-02-02 22:45:05 +0100
  • 49fb8cb462 Test cicd 2 Pieter Marsman 2022-02-02 22:42:26 +0100
  • f8bcb8ead3 Test cicd Pieter Marsman 2022-02-02 22:34:59 +0100
  • cbfc3aa192 Merge branch 'develop' into richardpaulhudson/develop Pieter Marsman 2022-02-02 22:29:49 +0100
  • b84cfc98e0
    Update development tools: travis ci to github actions, tox to nox, nose to pytest (#704) Pieter Marsman 2022-02-02 22:24:32 +0100
  • 8471af50b3 Remove lru_cache because computation is probably not that heavy, and it might cause bugs, and it is failing the ci pipeline now. Pieter Marsman 2022-02-02 22:23:01 +0100
  • 2016fe2d6e Fix lru_cache Pieter Marsman 2022-02-02 22:21:49 +0100
  • 1adcfa0ba4 Fix imports Pieter Marsman 2022-02-02 22:18:46 +0100
  • 46512aaa7e Replace nose.raises with pytest.raises Pieter Marsman 2022-02-02 22:14:02 +0100
  • 0b481efba1 Merge remote-tracking branch 'origin/develop' into update-tools Pieter Marsman 2022-02-02 22:10:45 +0100
  • d29c18379b Add names for jobs Pieter Marsman 2022-02-02 21:50:48 +0100
  • c4c5354657 Fix error with nox name for mypy Pieter Marsman 2022-02-02 21:47:46 +0100
  • c05f1cb263 Improve actions.yml Pieter Marsman 2022-02-02 21:46:33 +0100
  • 48479c9190
    Update .github/workflows/actions.yml Pieter Marsman 2022-02-02 21:39:20 +0100
  • 1d1602e0c5
    Added feature: page labels (#680) Andrew Baumann 2022-02-01 01:08:05 -0800
  • 7ce329c5d5 fix type errors and cleanup slightly Andrew Baumann 2022-01-31 21:20:11 -0800
  • 8342e02ccd Merge branch 'develop' into page-labels Andrew Baumann 2022-01-31 20:11:01 -0800
  • c371fa7850 Fix line too long in pdfdocument.py Pieter Marsman 2022-02-01 01:55:24 +0100
  • 3d3d3eda30 Replace md5 usage with sha256 knawattranakul 2022-01-31 11:26:07 -0800
  • e0350abcf3 Merge branch 'develop' into update-tools Pieter Marsman 2022-02-01 01:52:47 +0100
  • b19f9e7270
    Remove obsolete returns (#707) Pieter Marsman 2022-02-01 01:49:46 +0100
  • 8aea897940 Remove more empty lines Pieter Marsman 2022-02-01 01:48:23 +0100
  • 88c58219fe Remove empty lines Pieter Marsman 2022-02-01 01:44:21 +0100
  • c7d8b6bb3a Update CHANGELOG.md Pieter Marsman 2022-02-01 01:38:50 +0100
  • 0a56417b36 Remove obsolete returns Pieter Marsman 2022-02-01 01:35:35 +0100
  • 2610ef13af Revert "Remove obsolete returns" Pieter Marsman 2022-02-01 01:36:17 +0100
  • c67abdfab0 Remove obsolete returns Pieter Marsman 2022-02-01 01:35:35 +0100
  • 4b138a6bc5
    Only use xref fallback if `PDFNoValidXRef` is raised and `fallback` is True (#684) Tony(Baojia) Tong 2022-01-31 19:20:52 -0500
  • 3a86ee663d Merge branch 'develop' into TT_fix_fallback Pieter Marsman 2022-02-01 01:19:37 +0100
  • 6adbc1d94a Update changelog.md Pieter Marsman 2022-02-01 01:16:44 +0100
  • 4f01c50fdb Use fallback in except clause Pieter Marsman 2022-02-01 01:15:21 +0100
  • 483a716ac4 PageLabels *is* a NumberTree and should always behave like one. This justifies inheriting its data and behavior. And it simplifies the code a bit more. Pieter Marsman 2022-02-01 01:12:29 +0100
  • 68cef67520 Replace PDFPasswordIncorrect wtih PDFEncryptionWarning knawattranakul 2022-01-31 13:11:02 -0800
  • 010229a4d7 Refactor implementation of get_page_labels() into a NumberTree and PageLabels class. Pieter Marsman 2022-01-31 20:57:17 +0100
  • 78a4dba27e cleanup & respond to review feedback (incomplete) Andrew Baumann 2022-01-30 20:57:31 -0800
  • a061a0de7a Merge branch 'develop' into page-labels Pieter Marsman 2022-01-29 19:43:15 +0100
  • 3ac6bc7c6c Added line to CHANGELOG.md Pieter Marsman 2022-01-29 17:59:13 +0100
  • 13678857b6 Speedup slow tests to save GitHub actions minutes Pieter Marsman 2022-01-29 17:40:28 +0100
  • eebb59019f Remove nose Pieter Marsman 2022-01-29 17:25:30 +0100
  • 6cb2f95113 Run on all commits Pieter Marsman 2022-01-29 16:33:06 +0100
  • 44dd3fcbf8 Add pytest. Pieter Marsman 2022-01-29 16:27:49 +0100
  • 370cc0e2c1 Fix pytest, mypy and flake8 errors Pieter Marsman 2022-01-29 16:26:10 +0100
  • efd4410055 Replace travis with github actions Pieter Marsman 2022-01-29 16:24:36 +0100
  • efc54d4bac Replace tox with nox Pieter Marsman 2022-01-29 16:24:00 +0100
  • dc530f3a6f
    Use logger.warn instead of warnings.warn if warning cannot be prevented by user (#673) htInEdin 2022-01-26 19:41:12 +0000
  • a2b2d4bca6 Add reference to docs describing when to use logger and warnings Pieter Marsman 2022-01-26 20:39:28 +0100
  • 77472b0e3a Use logger.Logger.warn for failed decompression Pieter Marsman 2022-01-26 20:36:22 +0100
  • 6898d9796f Add docs to legacy warnings Pieter Marsman 2022-01-26 20:34:24 +0100
  • 040d26f105 Use logger as name for logger Pieter Marsman 2022-01-26 20:29:34 +0100
  • 767c4ef153 No need for testing if the warning is actually raised. The test_tootls_dumppdf.py are just test cases if these pdfs are supported. Pieter Marsman 2022-01-26 20:27:07 +0100
  • 9d7741f990 Remove patch Pieter Marsman 2022-01-26 20:23:46 +0100
  • 8634d0772f Small textual change Pieter Marsman 2022-01-26 20:21:58 +0100
  • 8473caa983 Update changelog to include pr ref Pieter Marsman 2022-01-26 20:18:15 +0100
  • 5a7a5d2e96 Keep warning classes such that this does not crash code when these warnings are explictly ignored Pieter Marsman 2022-01-26 20:12:14 +0100
  • c4ac514984
    Change log.info into log.debug to make pdfinterp.py less verbose crisptag 2022-01-27 00:27:55 +0530
  • 95dee8d67c
    Fix regression in page layout that sometimes returned text lines out of order (#659) Andrew Baumann 2022-01-26 10:55:08 -0800
  • 5c53867382 Merge branch 'develop' into issue658 Pieter Marsman 2022-01-26 19:54:37 +0100
  • 9a644aae76
    export type annotations in package (#679) Andrew Baumann 2022-01-25 13:11:17 -0800
  • 25503a1ef9
    Merge branch 'develop' into export-types-to-pypi Pieter Marsman 2022-01-25 22:11:07 +0100
  • 24eb15cae5
    fix typos in PR template (#681) Andrew Baumann 2022-01-25 13:08:14 -0800
  • d87bd025dd
    pdf2txt: clean up construction of LAParams from arguments (#682) Andrew Baumann 2022-01-25 13:06:06 -0800
  • c5b015ca6a Also use default values from LAParams for --detect-vertical and --all-texts Pieter Marsman 2022-01-25 21:57:59 +0100
  • 2215fa0405 Add cli argument for line_overlap Pieter Marsman 2022-01-25 21:57:22 +0100
  • d8296f6d0c Improve readability of setting LAParams by explicitly copying them from parsed_args into init of LAParams. And move all parsed_args post processing to the parse_args() method. Pieter Marsman 2022-01-25 21:56:59 +0100
  • 1fa998cb74 Merge branch 'develop' into pdf2txt_laparams Pieter Marsman 2022-01-25 21:42:26 +0100