Commit Graph

  • eefccc2f9d Fix paint_path bug noted in issue #473 Jeremy Singer-Vine 2020-09-26 11:00:02 -0400
  • 5c05e4dcbf Changes from code review Jake Stockwin 2020-09-23 10:00:16 +0100
  • 8099fab57f Deprecate Python 3.4 and 3.5 estshorter 2020-09-21 15:40:56 +0900
  • 3491a66c77 Fix out-of-bound access on some PDFs. Diego Elio Pettenò 2020-08-31 11:46:37 +0100
  • d8148be5a0 flake8 complaints fixed Ev2geny 2020-09-18 19:34:39 +0200
  • ae17f6ed86 TemporaryFilePath context manager is added to facilitate tests Ev2geny 2020-09-18 19:19:01 +0200
  • 5b7bf555e5 murged with master Ev2geny 2020-09-18 18:55:00 +0200
  • f03657e5c4
    Allow a pathlib.PurePath object as a input to open_filename (#492) estshorter 2020-09-18 04:29:00 +0900
  • 69a8018d1e change how default ascent and descent are applied Nathaniel Bush 2020-09-15 16:50:07 -0700
  • 140a6662e3
    Update utils.py ngioiosa 2020-09-14 14:48:24 -0600
  • 0ea14a5625
    Merge pull request #3 from vannevar-labs/pdfminer_layout_update nate-bush 2020-09-14 06:50:14 -0700
  • cd1a6c7049 rename orig vars Nathaniel Bush 2020-09-14 06:49:40 -0700
  • db64da2406 Documentation estshorter 2020-09-13 19:33:49 +0900
  • 575078b5ef Merge branch 'add_test' into develop estshorter 2020-09-13 19:18:12 +0900
  • 923653894e
    Update CHANGELOG.md Pieter Marsman 2020-09-13 11:39:32 +0200
  • 4ece1faa55 Add tests for open_filename estshorter 2020-09-13 01:21:01 +0900
  • bf3f7719a4
    Merge branch 'develop' into develop estshorter 2020-09-13 00:53:33 +0900
  • 023a0e3596 Raise an exception when open_filename gets an unsupported type estshorter 2020-09-12 22:50:37 +0900
  • 20a51a4816 Add link to the PR estshorter 2020-09-12 22:46:35 +0900
  • 2192646bf2 rename orig_bbox Nathaniel Bush 2020-09-11 11:03:07 -0700
  • 5dd5f0b2a8 pdf miner pr feedback Nathaniel Bush 2020-09-11 10:57:28 -0700
  • 82964b365e set default once Nathaniel Bush 2020-09-10 18:16:26 -0700
  • e58994c8d6 set a default ascent and descent Nathaniel Bush 2020-09-10 17:44:04 -0700
  • b4054ff4cf
    Pass caching parameter to PDFResourceManager in `high_level` functions (#475) David Nicholson 2020-09-10 15:09:07 -0400
  • a21459f8a5
    Merge branch 'develop' into patch-1 Pieter Marsman 2020-09-10 21:08:19 +0200
  • ff29a8536c
    Update CHANGELOG.md Pieter Marsman 2020-09-10 21:07:22 +0200
  • a83f853de7
    Remove unused rijndael encryption implementation (#465) Igor Moura 2020-09-10 14:28:00 -0300
  • 04796283e3
    Update CHANGELOG.md Pieter Marsman 2020-09-10 19:26:19 +0200
  • 4d8b5975cb
    Add section to documentation with howto for AcroForm fields extraction (#458) typhoon71 2020-09-10 19:18:41 +0200
  • d19e12edd5 Lifan's layout update Nathaniel Bush 2020-09-09 19:24:34 -0700
  • c686a14e2d Add changelog entry for: Handle XREFs with missing startxref after trailer Ben Murphy 2020-09-09 16:27:15 +0100
  • cd1d683f5e Handle XREFs with missing startxref after trailer Ben Murphy 2020-09-09 15:29:16 +0100
  • a790d1609c
    Cast a pathlib object to string for py3.4/3.5 estshorter 2020-09-09 22:43:03 +0900
  • 8365ee9074
    Fix a wrong function name estshorter 2020-09-09 22:29:35 +0900
  • 7eec14935e Add test for open_filename with pathlib estshorter 2020-09-09 22:14:15 +0900
  • b93c179faf open_filename accepts a pathlib.PurePath object estshorter 2020-09-09 17:52:52 +0900
  • 1611aff0ec hopefully better aproach pgajdos 2020-09-08 18:55:04 +0200
  • 618164a0d0 fix typo pgajdos 2020-09-08 18:48:01 +0200
  • cff19d6138 tools are in . pgajdos 2020-09-08 18:45:18 +0200
  • 728f6ddd29 blank line at the end of the file pgajdos 2020-09-08 18:34:46 +0200
  • 78d8cf82d0 formatting issues pgajdos 2020-09-08 18:32:22 +0200
  • aa96648c98 do not depend on nose pgajdos 2020-09-08 18:19:48 +0200
  • a67b141298 do not depend on nose pgajdos 2020-09-08 18:07:06 +0200
  • 7f990a0781
    Fix issue #136 zypriafl 2020-09-04 12:24:49 +0200
  • ff568f2d6a
    Updated CHANGELOG.md to reflect quick fix David Nicholson 2020-09-01 14:04:41 -0400
  • 1ec75f8e2c one extra comment to code is added Ev2geny 2020-09-01 00:18:51 +0200
  • 26d09bfe7e Issue #469 is fixed Ev2geny 2020-08-31 23:21:53 +0200
  • ca870a4228 Fix not being able to pass boxes flow as None to pdf2txt Jake Stockwin 2020-08-25 09:59:03 +0100
  • ca555d49da
    Reverted params back to fix errors David Nicholson 2020-08-21 10:36:44 -0400
  • fb1bdd73e6
    Updated high_level.py David Nicholson 2020-08-21 10:14:43 -0400
  • 2d996c9ae2 - "params" should evaluate to *False* or be a dictionary, but it was a *PDFObjRef* instead! ignacio 2020-08-17 15:30:11 +0200
  • 0ab1f0b696 Made PEP8 compliant. All flake8 errors eliminated Ev2geny 2020-08-11 19:03:50 +0200
  • efb35986ed
    Update CHANGELOG.md Ev2geny 2020-08-11 12:01:58 +0200
  • af63e90d95 functionality added to optionally separate words by more than one space evgeny 2020-08-11 11:52:23 +0200
  • 0d20c8ec8e tests for qnt_spaces_between_words=2 added evgeny 2020-08-11 10:29:55 +0200
  • da73cd1d38 qnt_spaces_between_words parameter is added to LAParams. All tests pass evgeny 2020-08-11 01:30:15 +0200
  • dd693650c6
    Use Python3 Emmanuel Arias 2020-08-09 20:23:21 -0300
  • cc51e18fb6
    Update pdfdocument.py typhoon71 2020-08-03 20:16:46 +0200
  • 374a52ba62
    Update pdfdocument.py typhoon71 2020-08-03 20:14:06 +0200
  • 8b2bd67324
    Update pdfdocument.py typhoon71 2020-08-03 20:08:07 +0200
  • f6d902e973
    Update docs/source/howto/acro_forms.rst typhoon71 2020-08-03 15:15:31 +0200
  • 5a277d90f0
    Update pdfdocument.py typhoon71 2020-08-03 15:12:51 +0200
  • 333a117158
    Add current PR link to CHANGELOG.md Igor Moura 2020-08-01 10:41:13 -0300
  • 1540e16f4c Remove unused rijndael encryption Igor Moura 2020-08-01 10:32:14 -0300
  • d2bd3cb702
    Update index.rst typhoon71 2020-07-26 18:45:35 +0200
  • 5f39581b0a
    Update README.md typhoon71 2020-07-26 18:43:37 +0200
  • 0215ea59b3
    Update index.rst typhoon71 2020-07-26 18:42:44 +0200
  • 0b44f77714 Move changelog line for #438 to current release 20200726 Pieter Marsman 2020-07-26 15:14:15 +0200
  • 391fe149ca Release 20200726 Pieter Marsman 2020-07-26 15:10:36 +0200
  • 66856a1016 Replace internal usage of PDFTextExtractionNotAllowedError (deprecated) with PDFTextExtractionNotAllowed Pieter Marsman 2020-07-26 15:09:32 +0200
  • 99f0c09869
    Restore PDFTextExtractionNotAllowed exception (#461) Philippe Ombredanne 2020-07-26 15:06:04 +0200
  • efc8f12ff4 Add line to CHANGELOG.md Pieter Marsman 2020-07-26 15:03:26 +0200
  • 4ff7061daf Use PDFTextExtractionNotAllowed and prepare PDFTextExtractionNotAllowedError to be removed in the future Pieter Marsman 2020-07-26 15:01:35 +0200
  • 2286307bd3
    Restore PDFTextExtractionNotAllowed Philippe Ombredanne 2020-07-26 11:51:39 +0200
  • 626092196b
    Fixed typo typhoon71 2020-07-24 11:40:41 +0200
  • 64f3b08391
    Delete tmp typhoon71 2020-07-24 11:37:27 +0200
  • 216a44de4e
    PDF file with AcroForms typhoon71 2020-07-24 11:36:04 +0200
  • ab2ab9af5d
    Delete AcroForm_TEST_compiled.pdf typhoon71 2020-07-24 11:34:56 +0200
  • 193b1c087f
    Delete AcroForm_TEST.pdf typhoon71 2020-07-24 11:34:38 +0200
  • 86ae7c3b5e
    Create tmp typhoon71 2020-07-24 11:31:30 +0200
  • 190be42d01
    PDF with AcroForm (samples) typhoon71 2020-07-24 11:27:30 +0200
  • d8aaa27a4c
    Proper processing of ComboBox typhoon71 2020-07-24 11:23:01 +0200
  • 4f65242750
    Always try to get CMap, even if name is not recognized (#438) Pieter Marsman 2020-07-23 20:27:38 +0200
  • bb77c163af Add comment Pieter Marsman 2020-07-23 20:26:59 +0200
  • a09312768d
    Update README.md typhoon71 2020-07-22 16:06:20 +0200
  • 4447d53b2c
    reverted changes typhoon71 2020-07-22 16:02:37 +0200
  • 65eda5236d
    Update acro_forms.rst typhoon71 2020-07-21 17:49:34 +0200
  • 2514e7119e
    Update docs/source/howto/acro_forms.rst typhoon71 2020-07-21 17:42:53 +0200
  • 187ba67f28
    Update docs/source/howto/acro_forms.rst typhoon71 2020-07-21 17:39:13 +0200
  • 703935bb9d
    Update docs/source/howto/acro_forms.rst typhoon71 2020-07-21 17:38:08 +0200
  • 3cebf5ef66 Release 20200720 20200720 Pieter Marsman 2020-07-20 22:05:19 +0200
  • c10cf3cdb8
    Change pycryptodome dependency to the faster, smaller, and industry standard cryptography package (#456) lithiumFlower 2020-07-20 16:00:54 -0400
  • 9d8202997b Fix line length Pieter Marsman 2020-07-20 21:26:59 +0200
  • 21e2aae012 Merge remote-tracking branch 'origin/391-fix-cmap-from-pickle-file' into 391-fix-cmap-from-pickle-file Pieter Marsman 2020-07-20 21:23:33 +0200
  • 26a2b7e201 Add CJK characters to expected output of simple3.pdf Pieter Marsman 2020-07-20 21:23:21 +0200
  • b7cc2b81a3 Optimize imports of pdfdocument.py Pieter Marsman 2020-07-20 21:01:27 +0200
  • 6bacfaeefe Add tests for encrypted documents to nose test suite Pieter Marsman 2020-07-20 20:46:44 +0200
  • f1e35ca757 Revert "add samples to nosetests" Pieter Marsman 2020-07-20 20:40:25 +0200
  • 6661965345
    Update acro_forms.rst typhoon71 2020-07-20 17:13:37 +0200
  • 73a3577031
    Update pdfdocument.py typhoon71 2020-07-20 17:00:38 +0200