pdfminer.six/pdfminer
Cathal Garvey 1b47bed306 Many changes to make pdf2txt.py work better in Py3, some in that script, others in module!
Sorry, changes should have been more atomic.

*In pdf2txt.py:*

* Re-wrote main function to use argparse instead of optparse.
* Manually tested in Py2/Py3 to get partial consistency.
* Errors abound including Tags mode, but most modes weren't working at all in Py3 anyway.
* Py2 mode *probably* unchanged, cannot find any bugs yet...
* Kept old main function for posterity, for now.

*In utils:*

* Added a few compatibility functions (some string hax required chardet, new dependency):
    - make_compat_bytes(in_str)-> (py3->bytes | py2->str)
    - make_compat_str(in_str)-> (str)
    - compatible_encode_method(bytesorstring, encoding, erraction)-> (str)

*In pdfdevice:*

* To handle different output filetypes in Py3, injected lots of calls to new utils methods,
  as well as some six.PYX checks and logic. These changes are largely responsible for
  enhanced Py2/Py3 consistency.

*In converter:*

* To handle output filetypes in Py2, injected a few checks and fixes particularly around the
  py2 `str.encode` method and its *assumed* usual use-analogies in Py3.
2015-05-17 21:08:57 +01:00
..
Makefile apply more patches 2010-02-13 15:00:43 +00:00
__init__.py version 20140915 pushed to PyPi as pdfminer_six 2014-09-15 10:33:04 +02:00
arcfour.py fixes for ARC4 2014-09-07 18:38:22 +02:00
ascii85.py Python 3.4 compatibility + tests 2014-09-04 09:36:19 +02:00
ccitt.py Python 3.4 compatibility + tests 2014-09-04 09:36:19 +02:00
cmapdb.py avoid string formating when not logging 2014-09-12 00:29:31 +02:00
converter.py Many changes to make pdf2txt.py work better in Py3, some in that script, others in module! 2015-05-17 21:08:57 +01:00
encodingdb.py Python 3.4 support and tests 2014-09-03 15:26:08 +02:00
fontmetrics.py PEP8: Remove trailing whitespace 2013-11-07 16:14:53 +09:00
glyphlist.py renamed: python2 -> python. 2013-10-17 23:05:27 +09:00
image.py more xrange to range 2014-09-16 23:17:31 +02:00
latin_enc.py renamed: python2 -> python. 2013-10-17 23:05:27 +09:00
layout.py Python 3.4 support and tests 2014-09-03 15:26:08 +02:00
lzw.py Python 3.4 compatibility + tests 2014-09-04 09:36:19 +02:00
pdfcolor.py tests pass under Py 2.7 and 3.4 2014-09-01 14:16:49 +02:00
pdfdevice.py Many changes to make pdf2txt.py work better in Py3, some in that script, others in module! 2015-05-17 21:08:57 +01:00
pdfdocument.py more xrange to range 2014-09-16 23:17:31 +02:00
pdffont.py decode cmap font name 2014-09-11 23:30:02 +02:00
pdfinterp.py fix dict_value usage in strict mode 2014-09-16 22:57:29 +02:00
pdfpage.py keep password api unicode, latin1 or utf-8 is encoded in handler 2014-09-16 22:58:25 +02:00
pdfparser.py avoid string formating when not logging 2014-09-12 00:29:31 +02:00
pdftypes.py avoid string formating when not logging 2014-09-12 00:29:31 +02:00
psparser.py avoid string formating when not logging 2014-09-12 00:29:31 +02:00
rijndael.py more xrange to range 2014-09-16 23:17:31 +02:00
runlength.py Python 3.4 compatibility + tests 2014-09-04 09:36:19 +02:00
utils.py Many changes to make pdf2txt.py work better in Py3, some in that script, others in module! 2015-05-17 21:08:57 +01:00