pdfminer.six

Commit Graph

Author	SHA1	Message	Date
fabbox	7eff108fa5	add shebang line to script in tools (#408 ) * add shebang line to script in tools * fix: use shebang line with python 3 * Moved changelog to unreleased Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>	2020-04-28 10:58:42 +02:00
Pieter Marsman	2f7f5d2667	Fallback on backwards-compatible key (F) for embedded files URL's when the unicode URL (UF) does not exist (#338 ) * Fix getting filename when extracting embedded files * Add test for pdf that contains embedded pdf, and fix additional errors in looping over multiple xrefs * Add line to CHANGELOG	2020-01-16 22:11:42 +01:00
Pieter Marsman	3502dc9f3b	Drop support for legacy Python 2 (#346 ) * Drop support for legacy Python 2 * Add python_requires to help pip * Upgrade Python syntax with pyupgrade * Upgrade Python syntax with pyupgrade --py3-plus * Python 3 imports * Replace six * Update CONTRIBUTING.md * Added line to changelog Co-authored-by: Hugo van Kemenade <hugovk@users.noreply.github.com>	2020-01-04 16:47:07 +01:00
Pieter Marsman	f3ab1bc61e	Enforce pep8 coding-style (#345 ) * Code Refractor: Use code-style enforcement #312 * Add flake8 to travis-ci * Remove python 2 3 comment on six library. 891 errors > 870 errors. * Remove class and functions comments that consist of just the name. 870 errors > 855 errors. * Fix flake8 errors in pdftypes.py. 855 errors > 833 errors. * Moving flake8 testing from .travis.yml to tox.ini to ensure local testing before commiting * Cleanup pdfinterp.py and add documentation from PDF Reference * Cleanup pdfpage.py * Cleanup pdffont.py * Clean psparser.py * Cleanup high_level.py * Cleanup layout.py * Cleanup pdfparser.py * Cleanup pdfcolor.py * Cleanup rijndael.py * Cleanup converter.py * Rename klass to cls if it is the class variable, to be more consistent with standard practice * Cleanup cmap.py * Cleanup pdfdevice.py * flake8 ignore fontmetrics.py * Cleanup test_pdfminer_psparser.py * Fix flake8 in pdfdocument.py; 339 errors to go * Fix flake8 utils.py; 326 errors togo * pep8 correction for few files in /tools/ 328 > 160 to go (#342) * pep8 correction for few files in /tools/ 328 > 160 to go * pep8 correction: 160 > 5 to go * Fix ascii85.py errors * Fix error in getting index from target that does not exists * Remove commented print lines * Fix flake8 error in pdfinterp.py * Fix python2 specific error by removing argument from print statement * Ignore invalid python2 syntax * Update contributing.md * Added changelog * Remove unused import Co-authored-by: Fakabbir Amin <f4amin@gmail.com>	2019-12-29 21:20:20 +01:00
Pieter Marsman	bc034c8e59	Create sphinx documentation for Read the Docs (#329 ) Fixes #171 Fixes #199 Fixes #118 Fixes #178 Added: tests for building documentation and example code in documentation Added: docstrings for common used functions and classes Removed: old documentation	2019-11-07 21:12:34 +01:00
Martin Hasoň	ed1b09c6f2	Fix debug logging for pdf2txt.py and dumppdf.py (#325 ) Fixes #313	2019-11-06 21:47:19 +01:00
Pieter Marsman	6cc78ee124	Replace opts by argparse in dumppdf.py (#321 ) Also add multi-character argument names Fixes #175	2019-10-27 21:40:04 +01:00
Attila Szász	938419c476	Align dumppdf tool to modified data structures. (#73 ) * Align dumppdf tool to modified data structures. TOC page numbers should also work now, counting from 1. * Update version number.	2017-07-20 20:46:11 +02:00
Antonio Ercole De Luca	0fdebc6739	Removing all the "#!/usr/bin/env python" lines, they do not need for … (#34 ) * Removing all the "#!/usr/bin/env python" lines, they do not need for python3, solving issue number: #19. * Restored all the shebangs in the tools and tests folders (because they are real executables) but used "#!/usr/bin/env python" instead of "#!/usr/bin/python" as this blog points out: https://www.peterbe.com/plog/importance-of-env Removed also the shebang from pdfminer/psparser.py file.	2016-11-08 20:01:11 +01:00
cybjit	2639b15ef4	guess argv encoding in py2 using sys.stdin.encoding	2014-09-16 23:17:26 +02:00
cybjit	14585987c3	keep password api unicode, latin1 or utf-8 is encoded in handler	2014-09-16 22:58:25 +02:00
cybjit	714423883c	setup logging for pdf2txt and fix dumppdf	2014-09-12 00:29:31 +02:00
unknown	28c2a4e6ad	2.7/3.4 encoding corrected	2014-09-04 10:31:33 +02:00
unknown	a6475b61b4	Python 3.4 support added and tested	2014-09-03 13:17:41 +02:00
Yusuke Shinyama	bb866ae148	Changed: new except syntax (2.6 or above).	2014-06-16 18:50:07 +09:00
Yusuke Shinyama	28e96ba3d0	Use print as a function.	2014-06-15 12:14:33 +09:00
Yusuke Shinyama	340387bfc6	Cleanup: isinstance	2014-03-28 17:50:59 +09:00
Yusuke Shinyama	f9079e4c0a	Fixed dumppdf.py issues.	2014-03-24 20:55:00 +09:00
Alex Rothberg	af8c4a6b8f	- only visit each objid once when dumping all objects	2013-11-18 20:41:09 -05:00
Matthew Duggan	c1da8b835c	PEP8: Remove trailing whitespace	2013-11-07 16:14:53 +09:00
Yusuke Shinyama	32844507ea	Fixed some style issues.	2013-10-19 08:41:01 +09:00
Yusuke Shinyama	28cb424f8f	Merge pull request #21 from eug48/master dumppdf: support for extracting embedded files using the -E option	2013-10-18 16:23:09 -07:00
Yusuke Shinyama	6ca9ac5434	chmod fix.	2013-10-17 23:06:07 +09:00
Yusuke Shinyama	0ea08890d4	renamed: python2 -> python.	2013-10-17 23:05:27 +09:00
Yusuke Shinyama	f85c374cae	Separated PDFPage to pdfpage.py.	2013-10-10 19:54:55 +09:00
Yusuke Shinyama	c926874d20	API Change: the PDFDocument cstr now takes PDFParser. set_parser() is removed.	2013-10-10 18:40:06 +09:00
Yusuke Shinyama	2221163b94	Split pdfparser.py and pdfdocument.py.	2013-10-10 18:29:30 +09:00
Yusuke Shinyama	1467fc674c	Added fallback for broken PDFs.	2013-10-09 22:45:54 +09:00
Yusuke Shinyama	06425bba00	Introducing PDFObjectNotFound	2013-10-09 21:39:23 +09:00
eug	925845b172	dumppdf: support for extracting embedded files using the -E option	2013-01-20 13:29:35 +10:00
yusuke.shinyama.dummy	509ab66319	stay with python2 git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@264 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-10-19 09:57:01 +00:00
yusuke.shinyama.dummy	afe33312c6	outline bug fixed git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@249 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-10-17 05:14:52 +00:00
yusuke.shinyama.dummy	ca5588a702	bugfix by Humberto Pereira git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@241 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-08-29 06:59:50 +00:00
yusuke.shinyama.dummy	9052cd1ea7	better TOC extraction git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@207 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-04-24 01:34:18 +00:00
yusuke.shinyama.dummy	2555b38836	fix typos (patches by sm) git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@183 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-02-15 14:50:19 +00:00
yusuke.shinyama.dummy	538a605ac0	several bugfixes. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@179 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-02-07 03:14:00 +00:00
yusuke.shinyama.dummy	dc6e5c366d	jpeg extraction support added. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@174 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-01-30 07:30:01 +00:00
yusuke.shinyama.dummy	98c8367339	warning removal. code cleanup. cmap bug fixed. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@168 1aa58f4a-7d42-0410-adbc-911cccaed67c	2010-01-01 03:09:26 +00:00
yusuke.shinyama.dummy	faa775897c	another bugfix git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@156 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-11-07 09:01:11 +00:00
yusuke.shinyama.dummy	f444c88e3d	testing against None with "is", not using "==" git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@153 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-11-06 15:10:29 +00:00
yusuke.shinyama.dummy	7790808560	to 4-space indentation git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@142 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-10-24 04:41:59 +00:00
yusuke.shinyama.dummy	3e12268bf6	rename package pdflib -> pdfminer. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@103 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-05-16 06:12:01 +00:00
yusuke.shinyama.dummy	f628c0d3fe	git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@101 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-05-15 14:34:53 +00:00
yusuke.shinyama.dummy	43e5c05307	handle error when an object was not found in dumpxml() git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@92 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-04-26 15:03:47 +00:00
yusuke.shinyama.dummy	f8510edffc	AsciiHexDecode filter patch incorporated. Thanks to Troy Bollinger. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@86 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-04-08 10:55:01 +00:00
yusuke.shinyama.dummy	70e42bff04	encoding bug fixed. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@74 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-03-24 16:26:59 +00:00
yusuke.shinyama.dummy	b432a3f4ae	patch from Troy Bollinger. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@71 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-02-28 05:44:08 +00:00
yusuke.shinyama.dummy	91770edd46	foo git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@59 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-01-10 09:25:03 +00:00
yusuke.shinyama.dummy	24bdd33557	various bugfixes git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@56 1aa58f4a-7d42-0410-adbc-911cccaed67c	2009-01-05 04:40:50 +00:00
yusuke.shinyama.dummy	71be16febe	wordspace handling improved. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@55 1aa58f4a-7d42-0410-adbc-911cccaed67c	2008-12-25 15:09:54 +00:00

1 2

54 Commits (7254530d2782aaf0b710f798ef91d3b673dcb25f)