Commit Graph

  • f83b3f8d7b Prettify _parse_string_1 Pieter Marsman 2021-08-31 21:57:37 +0200
  • 3e6a9879b1 Remove pdfminer/Changelog Pieter Marsman 2021-08-31 21:13:04 +0200
  • 2ad707ee73 Added test Pieter Marsman 2021-08-31 21:11:26 +0200
  • 8ea9f1091a
    Fix typos in converting_pdf_to_text.rst (#611) MapleCCC 2021-09-01 02:52:13 +0800
  • 8d3ac1a5a1
    Merge branch 'develop' into develop Pieter Marsman 2021-08-31 20:48:31 +0200
  • 46fa21476a
    Raise proper error when bad --output-type is used and fix formatting output of TagExtractor Mingye Wang 2021-09-01 02:46:20 +0800
  • 72af00718a Fix flake8 errors Pieter Marsman 2021-08-31 20:31:12 +0200
  • e5810f49b7 Update CHANGELOG.md Pieter Marsman 2021-08-31 20:27:25 +0200
  • b1310c343c Properly format tag names and tag values. Using utils.make_compat_str() such that the tag value is always a string. Pieter Marsman 2021-08-31 20:25:37 +0200
  • bb97a7e499 Rename _write_outfp to _write Pieter Marsman 2021-08-31 20:24:37 +0200
  • 32efbcdb58 Fix pycharm spacing warning Pieter Marsman 2021-08-31 20:10:43 +0200
  • 40fad9bb5d
    Merge branch 'develop' into patch-1 Pieter Marsman 2021-08-31 20:08:23 +0200
  • 4f4aa14488
    Use if-elif-else structure Pieter Marsman 2021-08-31 20:05:28 +0200
  • 7f54cefe02
    Use visible imports in highlevel.rst documentation (#609) Fiete 2021-08-30 22:17:21 +0200
  • ae5f2d8c55 (Unrelated to this MR) fix line length error Pieter Marsman 2021-08-30 22:15:04 +0200
  • c655c3e043 Optimize imports Pieter Marsman 2021-08-30 22:13:08 +0200
  • 9b8251ad19
    Merge branch 'develop' into highlevel-add-missing-import Pieter Marsman 2021-08-30 22:12:38 +0200
  • 06fead183f (Unrelated to this MR) Remove sys from converter.py Pieter Marsman 2021-08-30 22:09:00 +0200
  • 89e27e5cbd Remove obsolete check for python version; python 2 is not supported anymore Pieter Marsman 2021-08-30 22:03:11 +0200
  • 52cc6d7ede Replace testsetup with visible imports in documentation Pieter Marsman 2021-08-30 22:02:23 +0200
  • 1d33c026e4
    Updated link to Diátaxis documentation website (#606) Daniele Procida 2021-08-30 21:47:40 +0200
  • 047a246512
    Fix `AttributeError` when dumping a TOC with bytes destinations (#600) estshorter 2021-08-31 04:31:32 +0900
  • 92a3c8ee64
    Merge branch 'develop' into fix_dumpoutline Pieter Marsman 2021-08-30 21:30:58 +0200
  • ce80cfb4a7 Rename e() to escape() and merge two isinstance() checks Pieter Marsman 2021-08-30 21:26:55 +0200
  • a70f08818d
    Fix 594 use null id when encrypted but no id given (#595) Richard Millson 2021-08-29 15:32:14 -0400
  • 4275b2be93
    Merge branch 'develop' into fix-594-KeyError-when-no-ID-in-trailer Pieter Marsman 2021-08-29 21:14:56 +0200
  • 234c466372
    Fix extraction of some cjk characters (#593) wind_chh 2021-08-27 03:05:03 +0800
  • 36f7149011
    Merge branch 'develop' into 566-fix-cjk-characters-extraction Pieter Marsman 2021-08-26 21:04:24 +0200
  • d821fed340
    Fix typos in readthedocs documentation. (#579) X 2021-08-27 03:58:50 +0900
  • 543976f195
    Fix issue of ValueError and KeyError rasied in PDFdocument and PDFparser (#574) Tony(Baojia) Tong 2021-08-26 14:55:02 -0400
  • ea00f56ac6
    Added support for Paeth PNG filter compression (predictor value = 4) (#537) Eduardo Gonzalez Lopez de Murillas 2021-08-26 20:53:13 +0200
  • 0f6f968355 Make if-else consistent Pieter Marsman 2021-08-26 20:48:58 +0200
  • 39090721e8 Fix bug in filter type Up Pieter Marsman 2021-08-26 20:45:48 +0200
  • 5d26851b9d
    Merge branch 'develop' into develop Pieter Marsman 2021-08-26 20:39:01 +0200
  • 47e9ea63cf Fix merge conflict on readme Pieter Marsman 2021-08-26 20:36:10 +0200
  • 443d6d6f7f
    Update pdfinterp.py crisptag 2021-08-24 04:18:58 +0530
  • f3e28a1678 Add line about compressions to README.md Pieter Marsman 2021-08-23 21:39:23 +0200
  • 779121c179 Fix line length issues Pieter Marsman 2021-08-23 21:27:59 +0200
  • 8716e5671f Refactor: use variable names that are very close to the pseudo code and add pieces of the docs to show what is going on. Pieter Marsman 2021-08-23 21:20:01 +0200
  • fbb10b8282 update CHANGELOG Andrew Baumann 2021-08-18 08:23:24 -0700
  • b919f9f58f update CHANGELOG Andrew Baumann 2021-08-18 08:20:27 -0700
  • 9cd7ff9882 rewrap long lines Andrew Baumann 2021-08-17 12:01:15 -0700
  • 3a733b0448 fix the bug Andrew Baumann 2021-08-17 11:45:59 -0700
  • 5e663bba97 add a test Andrew Baumann 2021-08-17 11:45:54 -0700
  • 5e52132332 Fix pdf2txt --boxes-flow=disabled Andrew Baumann 2021-08-16 07:56:33 -0700
  • c90b5fa1e7 Use `above` and `upper_left` as in the pseudo code Pieter Marsman 2021-08-15 18:11:25 +0200
  • 19c1372984
    Fix for when 'trailer' is indented (#535) Jake Stockwin 2021-08-15 16:49:56 +0100
  • a09b984fbe
    Merge branch 'develop' into fix-trailer Pieter Marsman 2021-08-15 17:48:00 +0200
  • 53bd145e66 Add poetry venv management files to gitignore since I started using poetry to manage the python envs for this project Pieter Marsman 2021-08-15 17:46:03 +0200
  • 572e2b16fa Add CHANGELOG.md Pieter Marsman 2021-08-15 17:45:36 +0200
  • a0b8144a49 Remove commented code because no longer used. Pieter Marsman 2021-08-15 17:33:36 +0200
  • a5831d110a flake8 tests Forest Gregg 2021-08-01 22:59:17 -0400
  • 18ffa29387 add description in changelog Forest Gregg 2021-08-01 22:52:11 -0400
  • 6c7ee43d6c Fixes jbig2 writer to write valid jb2 files Forest Gregg 2021-08-01 22:43:36 -0400
  • 7598220084 Add HOCRConverter hudsonr 2021-07-29 22:10:43 +0200
  • 34ca35d9c2 fix trailing whitspace Forest Gregg 2021-07-29 10:44:07 -0400
  • 83fa7c103a Merge branch 'develop' of https://github.com/pdfminer/pdfminer.six into jpx_decode Forest Gregg 2021-07-29 10:40:17 -0400
  • b17d852d30 Merge branch 'develop' of https://github.com/pdfminer/pdfminer.six into tobytes Forest Gregg 2021-07-29 10:38:59 -0400
  • 016239c146
    Fix .paint_path handling of single line segments (#530) Jeremy Singer-Vine 2021-07-27 12:27:32 -0400
  • bc6cc711a7 Update travis badge Pieter Marsman 2021-07-27 18:04:15 +0200
  • d91d26aad4 Update travis badge Pieter Marsman 2021-07-27 18:03:27 +0200
  • fe26079f32 Revert "Trigger travis build" Pieter Marsman 2021-07-27 18:01:10 +0200
  • 41c0518497 Trigger travis build Pieter Marsman 2021-07-27 17:47:15 +0200
  • dfc6990f3f Optimize imports order Pieter Marsman 2021-07-27 17:36:49 +0200
  • 082e5db500 Extract variables from if statement to make it easier to read Pieter Marsman 2021-07-27 17:26:23 +0200
  • 66f5938164 Remove PDFLayoutAnalyzer._create_curve because implementation has become trivial due to refactoring Pieter Marsman 2021-07-27 17:25:58 +0200
  • cb53b1babc One changelog line with ref to mr Pieter Marsman 2021-07-27 16:53:50 +0200
  • d40e9a0cf6 Updated the CHANGELOG.md file with the maxobjects parameter addition change. Ran M. Bittmann 2021-07-26 14:08:07 +0300
  • 014f2aa58e Add maxobjects parameter to limit number of processed objects per page. 0, the default value means no limit, i.e. no change to the original use. Ran M. Bittmann 2021-07-26 12:32:47 +0300
  • 1fda826e12 typo in comment Forest Gregg 2021-07-19 11:15:03 -0400
  • aa3693761d add comment explaining why we are opening and saving JP2 images with PIL Forest Gregg 2021-07-19 11:14:24 -0400
  • a42d8f505c use PIL to open and save jp2 images Forest Gregg 2021-07-19 10:56:12 -0400
  • a11833b80d add changelox for jpx decode fix Forest Gregg 2021-07-19 09:53:30 -0400
  • 99536fbf1a handle jpx decoding Forest Gregg 2021-07-19 09:49:33 -0400
  • 62d3058f08 test for tobytes Forest Gregg 2021-07-14 11:35:30 -0400
  • 6483b32fa3 changelog entry Forest Gregg 2021-07-14 11:22:38 -0400
  • 13fb97c94c array.array.tostring -> array.array.tobytes Forest Gregg 2021-07-14 08:55:31 -0400
  • 106b13b9dc Attempt to handle decompression error on some broken PDF files Sylvain Thénault 2021-07-01 17:22:44 +0200
  • 7d8739984d
    Fixed a few typos Claas Z 2021-06-22 17:39:14 +0200
  • 70467bc88c format code to pass flake8 check huan_cheng 2021-05-28 20:31:43 +0800
  • 84f915b407 add support for Identity-H/V cmap fonts huan_cheng 2021-05-28 20:01:48 +0800
  • 624402ac70 Keep Travis CI happy Henry S. Thompson 2021-05-27 17:13:00 +0100
  • 357726e901 Delete \\\r\n in Literal Strings (ref. section 7.3.4.2 of PDF32000_2008) Henry S. Thompson 2021-05-22 12:24:57 +0100
  • 6c7301b56d CS-6997: vary gridsize for Plane class Andrey Sitaev 2021-05-13 18:47:36 +0300
  • 7e468ecbec feat: Add support for ISO 32000-2 AES256 encryption Raphael Cohen 2021-05-05 18:00:33 +0200
  • 737a10022d I don't understand the CHANGELOG.md format, hope this is good enough Henry S. Thompson 2021-05-05 17:03:45 +0100
  • 74cbc6706d detect TextIOWrapper as non-binary Henry S. Thompson 2021-05-05 16:59:39 +0100
  • 83031a36fb Trim redundant spaces MapleCCC 2021-04-28 04:27:09 +0800
  • 17e9d6dbfe The word "pdfminer.six" as a whole should not be separated by newline, otherwise they are treated as two separated words by renderer, and incorrectly displayed as separated. MapleCCC 2021-04-28 04:26:48 +0800
  • a4374f6b72 Fix typos in converting_pdf_to_text.rst MapleCCC 2021-04-28 04:24:09 +0800
  • 74ce47e19e TagExtractor: eliminate runtime error Mingye Wang 2021-04-24 18:05:51 +0800
  • f0500d837c high_level: emit diagnostic for bad output_type Mingye Wang 2021-04-24 17:55:09 +0800
  • 513592691c add missing import for extract_text_to_fp Fiete Ostkamp 2021-04-24 09:46:54 +0200
  • 401a184779 Updated link to Diátaxis documentation website Daniele Procida 2021-04-10 16:26:22 +0100
  • be9e216ba6 Speedup asciihexdecode Cristian Ciupitu 2021-04-07 22:22:14 +0300
  • 1cecc56da0
    Update CHANGELOG.md estshorter 2021-03-25 21:40:59 +0900
  • 76ea29b61c
    Update CHANGELOG.md estshorter 2021-03-25 21:33:31 +0900
  • 4b02cf5c17
    Fix a bug that a TOC title variable is a bytes type estshorter 2021-03-25 21:31:36 +0900
  • 81ce5c8b3c
    Fix an error when dumping a TOC estshorter 2021-03-25 21:27:07 +0900
  • f649d7130d
    Merge pull request #2 from pdfminer/develop wind_chh 2021-03-15 11:14:26 +0800