Commit Graph

840 Commits (e59b1bca2f764de8e023b49c70d64e80c1bd6b78)

Author SHA1 Message Date
Pieter Marsman e4790fdbc2 Add AES as supported encryption method to docs 2020-01-07 18:38:53 +01:00
Pieter Marsman b27d3d0aff Bump version 2020-01-04 18:15:15 +01:00
Pieter Marsman 6eb9957e8a Update docs: at least python 3.4 is needed now 2020-01-04 16:51:54 +01:00
Pieter Marsman 3502dc9f3b
Drop support for legacy Python 2 (#346)
* Drop support for legacy Python 2

* Add python_requires to help pip

* Upgrade Python syntax with pyupgrade

* Upgrade Python syntax with pyupgrade --py3-plus

* Python 3 imports

* Replace six

* Update CONTRIBUTING.md

* Added line to changelog

Co-authored-by: Hugo van Kemenade <hugovk@users.noreply.github.com>
2020-01-04 16:47:07 +01:00
Pieter Marsman f3ab1bc61e
Enforce pep8 coding-style (#345)
* Code Refractor: Use code-style enforcement #312

* Add flake8 to travis-ci

* Remove python 2 3 comment on six library. 891 errors > 870 errors.

* Remove class and functions comments that consist of just the name. 870 errors > 855 errors.

* Fix flake8 errors in pdftypes.py. 855 errors > 833 errors.

* Moving flake8 testing from .travis.yml to tox.ini to ensure local testing before commiting

* Cleanup pdfinterp.py and add documentation from PDF Reference

* Cleanup pdfpage.py

* Cleanup pdffont.py

* Clean psparser.py

* Cleanup high_level.py

* Cleanup layout.py

* Cleanup pdfparser.py

* Cleanup pdfcolor.py

* Cleanup rijndael.py

* Cleanup converter.py

* Rename klass to cls if it is the class variable, to be more consistent with standard practice

* Cleanup cmap.py

* Cleanup pdfdevice.py

* flake8 ignore fontmetrics.py

* Cleanup test_pdfminer_psparser.py

* Fix flake8 in pdfdocument.py; 339 errors to go

* Fix flake8 utils.py; 326 errors togo

* pep8 correction for few files in /tools/ 328 > 160 to go (#342)

* pep8 correction for few files in /tools/ 328 > 160 to go

* pep8 correction: 160 > 5 to go

* Fix ascii85.py errors

* Fix error in getting index from target that does not exists

* Remove commented print lines

* Fix flake8 error in pdfinterp.py

* Fix python2 specific error by removing argument from print statement

* Ignore invalid python2 syntax

* Update contributing.md

* Added changelog

* Remove unused import

Co-authored-by: Fakabbir Amin <f4amin@gmail.com>
2019-12-29 21:20:20 +01:00
Martin Hasoň 78f06225b6 Removed duplicated and therefore unused code from pdf2txt.py (#341) 2019-12-09 22:04:05 +01:00
Pieter Marsman 452f0b4ad0 Merge branch 'develop' 2019-11-10 12:59:55 +01:00
Pieter Marsman 803a7d9598 Release 20191110 2019-11-10 12:29:14 +01:00
Pieter Marsman 2bee7d8dcf
Fix wrong ordering of grouping textboxes introduced by #315. The first grouping of textboxes should be skipped if there are intermediate textboxes. (#335)
Fixes #334
2019-11-10 12:18:49 +01:00
Pieter Marsman b63a636512 Merge branch 'develop' 2019-11-07 21:52:58 +01:00
Pieter Marsman 5c6fa8f986 Release 20191107 2019-11-07 21:52:44 +01:00
Pieter Marsman bc034c8e59
Create sphinx documentation for Read the Docs (#329)
Fixes #171
Fixes #199
Fixes #118
Fixes #178
Added: tests for building documentation and example code in documentation
Added: docstrings for common used functions and classes
Removed: old documentation
2019-11-07 21:12:34 +01:00
Igor Moura 40aa2533c9 Added: simple wrapper to extract text from pdf (#330)
Fixes #327
2019-11-07 07:54:10 +01:00
Pieter Marsman 027bb62943 Merge branch 'develop' of github.com:pdfminer/pdfminer.six into develop 2019-11-06 21:51:41 +01:00
Pieter Marsman 548b933a84 Add line to CHANGELOG.md for #325 2019-11-06 21:51:34 +01:00
Martin Hasoň ed1b09c6f2 Fix debug logging for pdf2txt.py and dumppdf.py (#325)
Fixes #313
2019-11-06 21:47:19 +01:00
Pieter Marsman 33b16b3f07
Deprecate the use of _py2_no_more_posargs (#328)
Fixes #324
2019-11-02 10:29:39 +01:00
Jianfeng 44b223cf0a Speedup grouping of textboxes (#315)
Changed: using a heap instead of a SortedList and avoid rebuilding the heap in each iteration
Changed: avoid potentially huge number of variable assignments in list comprehension.
Changed: avoid repeatly evaluating `obj is obj` in list comprehension by storing id(obj).
2019-10-31 09:22:58 +01:00
Pieter Marsman 6cc78ee124
Replace opts by argparse in dumppdf.py (#321)
Also add multi-character argument names
Fixes #175
2019-10-27 21:40:04 +01:00
Pieter Marsman 347c125fb8 Revert "Move old documentation to subfolder"
This reverts commit a2e6c7c0
2019-10-27 14:26:11 +01:00
Pieter Marsman a2e6c7c0c9 Move old documentation to subfolder 2019-10-27 14:21:47 +01:00
Pieter Marsman d88d6020a2
Remove webapp and other (un)helpful application references: django, cgi, and pyinstaller. (#320)
Fixes #314 
Fixes #105
2019-10-26 19:16:37 +02:00
Pieter Marsman 1c4a4167ed
Fix failing test on develop & cleaning up test files (#319) 2019-10-26 18:42:33 +02:00
Pieter Marsman a238a19999
Fix assertionerror when dumping pdf with reference to objid 0 (#318)
Fixes #94 
Added: test to get check if `PDFObjectNotFound` error is raised if objid 0 is requested.
2019-10-25 22:49:58 +02:00
Pieter Marsman 5516c8147c Add new python versions for travis 2019-10-23 21:05:30 +02:00
Serj Sintsov cb9cd8ea46 Use named logger instead of root logger (#236) 2019-10-22 20:52:43 +02:00
jbarlow83 733ddf7e57 Added: tests for extracting tests from pdfs with Type3 fonts (#205) 2019-10-22 18:15:59 +02:00
Pieter Marsman 373c6e7b97
Added: extraction of JBIG2 encoded images (#311)
And added test for pdf with JBIG2 image.

Fixes #26 
Closes #46
2019-10-22 17:37:06 +02:00
Pieter Marsman 3001fe3a82
Add item to checklist about updating CHANGELOG.md 2019-10-22 17:30:09 +02:00
Pieter Marsman 9153e835bc
Merge pull request #310 from pdfminer/test-pr
Use same format for pull_request_template.md as for bug_report.md
2019-10-21 23:12:44 +02:00
Pieter Marsman 36621c8038 Use same format for pull_request_template.md as for bug_report.md 2019-10-21 23:08:57 +02:00
Pieter Marsman 131611ab6b
Create pull_request_template.md 2019-10-21 23:02:50 +02:00
Pieter Marsman 049919a981 Update issue templates 2019-10-21 22:51:45 +02:00
Pieter Marsman a5a34d53bd
Merge develop into master for release 20191020 2019-10-20 14:27:19 +02:00
Pieter Marsman fa34c8d215 Merge branch 'master' into develop 2019-10-20 14:24:44 +02:00
Pieter Marsman 694aa508c3 Release 20191020 2019-10-20 14:21:48 +02:00
Pieter Marsman 1d3fa415b5 Remove licence from README.md because it is already in the LICENCE file 2019-10-20 14:19:18 +02:00
Pieter Marsman 19c078f0d0 Update CONTRIBUTING.md 2019-10-20 14:18:26 +02:00
Pieter Marsman adc4726e06
Add warning about dropping python2 support (#307)
Fix #303
2019-10-20 13:59:29 +02:00
Pieter Marsman dd7dc7b684 Use keep-a-changlog format for CHANGELOG.md and add unreleased additions, fixes and changes. 2019-10-20 12:32:11 +02:00
Hugo van Kemenade 12bba5b5f7 Only define dependencies in setup.py (#306)
Fixes #299. Closes #300.

Changed: define dependencies in setup.py using install_requires and extra_requires. 
Added: section to CONTRIBUTE.md for initial dev setup.
2019-10-20 11:41:31 +02:00
Pieter Marsman 9fd7172f7b Cleanup utils.py 2019-10-17 12:14:02 +02:00
jet457 7e40fde320 Removing assertion in drange to allow equal inputs (#246) and mimic behaviour of built-in method range
Fixes #66, since it now allows the bbox to have 0 width or 0 height
Added tests for Plane since it is the API that uses drange
2019-10-17 12:04:25 +02:00
D.A.Bashkirtsev 4df6d4e5ca Changed: comparations for image colorspace literals (#132)
Fixes #131 

Changed: comparations for image colorspace literals
Added: test for extracting images from pdfs
2019-10-15 16:11:54 +02:00
Pieter Marsman 63b2e09ac3
Merge pull request #203 from jbarlow83/negative-descent
Interpret font Descent as a negative number even if specified as positive
2019-10-13 20:06:52 +02:00
Pieter Marsman eae70b999c
Merge pull request #219 from FelixSchwarz/fixed-wheel-requirements
use conditional requirements to ensure "chardet" is always required on Python 3
2019-10-13 19:57:05 +02:00
Tata Ganesh c6ad95ea87
Merge pull request #298 from tongbaojia/fix_color_graphicstate_dev
Fix stoke color and non-stroke color in PDFGraphicState
2019-10-13 19:27:09 +05:30
Tony Tong 106a09c5bb fix stoke color and non-stroke color in PDFGraphicState 2019-10-12 17:35:46 -04:00
Tata Ganesh f218996fe9
Merge pull request #273 from igormp/develop
Use resolve_all on PdfFont widths and bbox
2019-10-12 21:24:29 +05:30
Tata Ganesh f53fbd98b1
Merge pull request #283 from fakabbir/pdfstream-as-cmap
Pdfstream as cmap
2019-10-12 21:22:52 +05:30