pdfminer.six

Commit Graph

Author	SHA1	Message	Date
Richard Millson	a70f08818d	Fix 594 use null id when encrypted but no id given (#595 ) Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2021-08-29 21:32:14 +02:00
Tony(Baojia) Tong	543976f195	Fix issue of ValueError and KeyError rasied in PDFdocument and PDFparser (#574 ) * check obj type * update changelog * Update CHANGELOG.md * fix the bug * fix condition * update changelog * update changelog again * update changelog * update Co-authored-by: Pieter Marsman <pietermarsman@gmail.com> Co-authored-by: Tony Tong <baojia.tong@kensho.com>	2021-08-26 20:55:02 +02:00
Jake Stockwin	19c1372984	Fix for when 'trailer' is indented (#535 ) * Fix for when trailer is indented * Store stripped line * This commit breaks things... * Or maybe this one breaks things? * Remove commented code because no longer used. * Add CHANGELOG.md * Add poetry venv management files to gitignore since I started using poetry to manage the python envs for this project Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2021-08-15 17:49:56 +02:00
Pieter Marsman	178a831802	Revert "Fix for when 'trailer' is indented (#513 )" (#534 ) This reverts commit `ec223d1f1d`.	2020-10-25 13:22:42 +01:00
Jake Stockwin	ec223d1f1d	Fix for when 'trailer' is indented (#513 ) * Fix for when 'trailer' is indented Closes #214 * Address CR comments - strip line after parsing * Update CHANGELOG.md Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-10-24 18:55:07 +02:00
Diego Elio Pettenò	67e2d79591	Fix out-of-bound access on some PDFs. (#483 ) Replace the non-emptiness check with a minimum length check — you can't get the second to last item in a list of less than two items.	2020-10-10 15:18:34 +02:00
Philippe Ombredanne	99f0c09869	Restore PDFTextExtractionNotAllowed exception (#461 ) * Restore PDFTextExtractionNotAllowed Restore PDFTextExtractionNotAllowed exception class as an alias of the new PDFTextExtractionNotAllowedError exception that was introduced in `6a9269b432` Removing PDFTextExtractionNotAllowed is an API breakage that made several tools fail break. Signed-off-by: Philippe Ombredanne <pombredanne@nexb.com> * Use PDFTextExtractionNotAllowed and prepare PDFTextExtractionNotAllowedError to be removed in the future * Add line to CHANGELOG.md Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-07-26 15:06:04 +02:00
lithiumFlower	c10cf3cdb8	Change pycryptodome dependency to the faster, smaller, and industry standard cryptography package (#456 ) * swap pycryptodome to the faster, smaller, and industry standard crytography io * update changelog * fixlint * Update CHANGELOG.md * from MR, unneeded ex and naming * add samples to nosetests * fix lint * show mismatch * fix lint * typo and newline * Revert "add samples to nosetests" This reverts commit `a49ca302` * Add tests for encrypted documents to nose test suite * Optimize imports of pdfdocument.py Co-authored-by: Oren Tysor <oren@atakama.com> Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-07-20 22:00:54 +02:00
madhurcodes	6a9269b432	Change Text extraction is not allowed error to warning (#453 ) * Changed error to warning for 'Text extraction is not allowed' * updated changelog * fix lint * made changes suggested in review * Update CHANGELOG.md * Add regression test for failing pdf * Reduce line length to <80 Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-07-11 16:04:11 +02:00
Pieter Marsman	6e05baf0b7	Dont dump fallback xref by default when using dumppdf.py, adding a flag to enable it Fixes #176 * Add failing test for dumping simple1.pdf and simple3.pdf, because they should raise an error when dumppdf.py tries to dump a pdf without xref's * Raise PDFNoValidXRef with explanation if dumppdf.py is called on a pdf that does not have an xref * Use warning instead of error, because not output xrefs is just fine (there aren't any) but it is something the user should know * Adding changelog * Extend help message	2020-05-23 18:04:34 +02:00
Recursing	0b1741b9bf	Pack the /P (ermissions) entry from the /Encrypt dictionionary in the file trailer, as unsigned long (#352 ) Fixes #186 * Tread the permissions (the /P entry) as unsigned long, fix #186 * handle negative values for p * Extract function for resolving an twos-complement * Add test for issue #352 * Add line to CHANGELOG.md * Only ints can be converted to a uint using two's-complement method * Standardize import style; multiple imports from same module on one line Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-01-07 21:59:13 +01:00
Pieter Marsman	3502dc9f3b	Drop support for legacy Python 2 (#346 ) * Drop support for legacy Python 2 * Add python_requires to help pip * Upgrade Python syntax with pyupgrade * Upgrade Python syntax with pyupgrade --py3-plus * Python 3 imports * Replace six * Update CONTRIBUTING.md * Added line to changelog Co-authored-by: Hugo van Kemenade <hugovk@users.noreply.github.com>	2020-01-04 16:47:07 +01:00
Pieter Marsman	f3ab1bc61e	Enforce pep8 coding-style (#345 ) * Code Refractor: Use code-style enforcement #312 * Add flake8 to travis-ci * Remove python 2 3 comment on six library. 891 errors > 870 errors. * Remove class and functions comments that consist of just the name. 870 errors > 855 errors. * Fix flake8 errors in pdftypes.py. 855 errors > 833 errors. * Moving flake8 testing from .travis.yml to tox.ini to ensure local testing before commiting * Cleanup pdfinterp.py and add documentation from PDF Reference * Cleanup pdfpage.py * Cleanup pdffont.py * Clean psparser.py * Cleanup high_level.py * Cleanup layout.py * Cleanup pdfparser.py * Cleanup pdfcolor.py * Cleanup rijndael.py * Cleanup converter.py * Rename klass to cls if it is the class variable, to be more consistent with standard practice * Cleanup cmap.py * Cleanup pdfdevice.py * flake8 ignore fontmetrics.py * Cleanup test_pdfminer_psparser.py * Fix flake8 in pdfdocument.py; 339 errors to go * Fix flake8 utils.py; 326 errors togo * pep8 correction for few files in /tools/ 328 > 160 to go (#342) * pep8 correction for few files in /tools/ 328 > 160 to go * pep8 correction: 160 > 5 to go * Fix ascii85.py errors * Fix error in getting index from target that does not exists * Remove commented print lines * Fix flake8 error in pdfinterp.py * Fix python2 specific error by removing argument from print statement * Ignore invalid python2 syntax * Update contributing.md * Added changelog * Remove unused import Co-authored-by: Fakabbir Amin <f4amin@gmail.com>	2019-12-29 21:20:20 +01:00
Pieter Marsman	a238a19999	Fix assertionerror when dumping pdf with reference to objid 0 (#318 ) Fixes #94 Added: test to get check if `PDFObjectNotFound` error is raised if objid 0 is requested.	2019-10-25 22:49:58 +02:00
Philippe Guglielmetti	b010db6049	solves https://github.com/pdfminer/pdfminer.six/issues/65	2017-07-20 21:17:06 +02:00
Hugh Secker-Walker	488545ddc7	Add string expressions to asserts showing local data (#67 )	2017-05-29 09:06:09 +02:00
Philippe Guglielmetti	82af7f0aac	issue #56 reproduced, solution attempt unsucessful	2017-04-19 14:19:14 +02:00
Antonio Ercole De Luca	0fdebc6739	Removing all the "#!/usr/bin/env python" lines, they do not need for … (#34 ) * Removing all the "#!/usr/bin/env python" lines, they do not need for python3, solving issue number: #19. * Restored all the shebangs in the tools and tests folders (because they are real executables) but used "#!/usr/bin/env python" instead of "#!/usr/bin/python" as this blog points out: https://www.peterbe.com/plog/importance-of-env Removed also the shebang from pdfminer/psparser.py file.	2016-11-08 20:01:11 +01:00
Friedrich Lindenberg	1d54ecd31c	Make the logger run in a namespace.	2016-05-20 21:12:05 +02:00
Steve Hair	92c71436b9	Improved settings management	2016-01-10 12:17:38 -05:00
Chris Hager	b686dd0139	pdfminer/settings.py for STRICT and added ENFORCE_CHECK_EXTRACTABLE	2015-11-01 22:28:08 +01:00
cybjit	515687e1bb	more xrange to range	2014-09-16 23:17:31 +02:00
cybjit	ad05121c69	password py3	2014-09-16 22:59:00 +02:00
cybjit	14585987c3	keep password api unicode, latin1 or utf-8 is encoded in handler	2014-09-16 22:58:25 +02:00
cybjit	39942b6642	avoid string formating when not logging	2014-09-12 00:29:31 +02:00
cybjit	cc733c8217	fixes for ARC4	2014-09-07 18:38:22 +02:00
unknown	a6475b61b4	Python 3.4 support added and tested	2014-09-03 13:17:41 +02:00
unknown	846cd18186	Python 3.4 support	2014-09-02 15:49:46 +02:00
unknown	faea7291a8	tests pass under Py 2.7 and 3.4	2014-09-01 14:16:49 +02:00
Yusuke Shinyama	b0e035c24f	Style fix: always have an explicit return.	2014-07-15 21:38:29 +09:00
Yusuke Shinyama	1ccfaff411	String-Bytes distinction (first attempt).	2014-06-30 19:05:56 +09:00
Yusuke Shinyama	8791355e1d	Cleanup imports. Use relative imports.	2014-06-26 18:12:39 +09:00
Yusuke Shinyama	1384a3fe8d	Code cleanup: removed some debug flags.	2014-06-14 15:43:10 +09:00
Yusuke Shinyama	8e14ebf4e1	Use logging module instead of print.	2014-06-14 12:00:49 +09:00
numion	a4997d6f10	Implement revision 4 and 5 encryption handler.	2014-05-19 16:27:43 +02:00
Yusuke Shinyama	7849c8724a	Fixed: PDFXRefStream.get_objids returns invalid objids.	2014-03-28 17:29:26 +09:00
Yusuke Shinyama	57adad55d7	Revert the wrong fix.	2014-03-28 17:24:03 +09:00
Yusuke Shinyama	ee47a6603a	Fixed: issues #45	2014-03-28 00:18:17 +09:00
Yusuke Shinyama	4b2beba398	Code cleanup.	2014-03-24 20:59:24 +09:00
Yusuke Shinyama	f9079e4c0a	Fixed dumppdf.py issues.	2014-03-24 20:55:00 +09:00
Yusuke Shinyama	607be269ab	Applied a patch by Axel Kaiser.	2014-03-24 20:45:35 +09:00
Yusuke Shinyama	d7c4ff28e9	Applied a patch by Axel Kaiser.	2014-03-24 20:39:30 +09:00
Yusuke Shinyama	c97ec3048e	Changed / to // for clarity.	2013-11-26 21:35:16 +09:00
Yusuke Shinyama	b589da51b7	Fix for malformed PDFs.	2013-11-26 21:27:45 +09:00
Yusuke Shinyama	2b56b2eedf	Merged.	2013-11-07 19:50:41 +09:00
Matthew Duggan	2caa5edc25	PEP8: Whitespace changes to match pep8	2013-11-07 17:35:04 +09:00
Matthew Duggan	c1da8b835c	PEP8: Remove trailing whitespace	2013-11-07 16:14:53 +09:00
Matthew Duggan	10a68c83bd	Remove unused imports identified by pyflakes	2013-11-07 16:09:44 +09:00
Yusuke Shinyama	0ea08890d4	renamed: python2 -> python.	2013-10-17 23:05:27 +09:00
Yusuke Shinyama	1455f134c6	Fixed: missing ObjStm due to invalid seek.	2013-10-10 20:10:57 +09:00

1 2

58 Commits (8ea9f1091a7eef307a80483fbdc6265e1fcf925f)