Commit Graph

704 Commits (1e24bfa0bd1ef332e30ffd57b2328ecacc0ff6c4)

Author SHA1 Message Date
Pieter Marsman 1e24bfa0bd Fix error, python2 cannot handle unicode in a .py file 2019-07-14 15:40:22 +02:00
Pieter Marsman c597e95a9f Use KeyError to signal that the name does not resemble any unicode, this pattern is also used in the rest of pdfminer.six 2019-07-14 15:37:15 +02:00
Pieter Marsman fdb7e54862 Add lowercase adobe glyph name tests 2019-07-14 15:20:25 +02:00
Pieter Marsman 33cc9861ae Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to '' 2019-07-14 15:19:17 +02:00
Pieter Marsman f0392f8049 Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase) 2019-07-14 15:16:42 +02:00
Pieter Marsman 5d7ac7e88a Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_ 2019-07-10 20:44:23 +02:00
Pieter Marsman ec5218a05f Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification 2019-07-10 20:35:42 +02:00
Tata Ganesh 127d587431
Merge pull request #227 from BackEndTea/patch-1
Remove self refference on python3
2019-07-10 23:12:30 +05:30
Tata Ganesh 578464ab00
Merge pull request #259 from pietermarsman/258-add-contribution-guidelines
Add contribution guidelines and refer to it in the readme
2019-07-10 23:10:25 +05:30
Pieter Marsman 5acfdd8f9b Add sentence about including pdf's in issues 2019-07-10 19:38:38 +02:00
Pieter Marsman 2743f2b20a Add reference to contribution guidelines in README.md 2019-07-08 23:05:47 +02:00
Pieter Marsman a03566da21 Add contribution guidelines 2019-07-08 23:03:47 +02:00
Gert de Pagter 594321fcf6
Remove self refference on python3
This *is* the 'six' repo, so no need to mention that again in the readme
2019-02-25 16:38:46 +01:00
ganeshtata b6a5848208 FEAT: Release 20181108 2018-11-08 22:37:11 +05:30
ganeshtata 90d61f2a3a DOCS: changelog for version 20181108 added 2018-11-08 22:08:17 +05:30
Tata Ganesh e03ecab856
Merge pull request #141 from timb07/speedup_layout
Speed up layout of text boxes
2018-11-08 20:28:40 +05:30
Tata Ganesh 1ea9446bd6
Merge pull request #173 from wlbentley/pdf2txt-maketheparser
Move argparse object setup out of pdf2txt.py main to separate function.
2018-08-21 23:40:50 +05:30
Wm Bentley 495c92e050 Move argparse object setup out of main to separate function.
As preparation for implementing Sphinx documentation, create a
separate function that builds and returns the argparse parser.
Move import argparse out of main to the top of the file.
2018-08-12 21:07:52 -07:00
Tata Ganesh 259b29299e
Merge pull request #133 from timb07/speedup
Speed up handling of PDFs with large images
2018-07-15 11:27:35 +05:30
Tata Ganesh d437e5c637
Merge pull request #142 from mawoqiw/master
Fixes needed to be able to compile pdfminer.six with Cython
2018-07-14 18:02:29 +05:30
Martin Wolf edaf2c9e3f move unittest to main() 2018-06-26 00:51:51 +02:00
Martin Wolf 603a927e6c remove pygame from dependencies, it is actually not required to compile with Cython 2018-06-26 00:50:45 +02:00
Martin Wolf eff3f19886 Merge remote-tracking branch 'upstream/master' 2018-06-25 23:32:52 +02:00
Tata Ganesh 9c7bdcc716
Merge pull request #157 from h2ri/master
decode cid: 160 and 173 to spaces
2018-06-25 11:19:27 +05:30
Tata Ganesh 07eafe7b27
Merge pull request #160 from charlesreid1/fix-line-endings
apply dos2unix to files in pdfminer/tools dirs to remove \r\n windows line endings
2018-06-25 11:19:14 +05:30
Charles Reid 7b08cdbff9 apply dos2unix to files in pdfminer/ and tools/ to remove \r\n windows line endings 2018-06-21 12:19:48 -07:00
Goulu 1db260609e
render_string must have 5 params in all PDFDevice classes (#158) 2018-06-21 10:21:26 +02:00
Guglielmetti Philippe 70624a64dd render_string() now takes 3 parameters, not 5 (reverted from commit 95b65536af) 2018-06-21 09:49:45 +02:00
Guglielmetti Philippe 95b65536af render_string() now takes 3 parameters, not 5 2018-06-21 09:28:55 +02:00
Healthi 65eb0cef82 decode cid: 160 and 170 to spaces 2018-06-20 17:17:03 +05:30
Martin Wolf 26f80715ed Merge remote-tracking branch 'upstream/master' 2018-06-20 13:27:18 +02:00
Tata Ganesh ac8bb81c72
DOCS: Update Readme.md 2018-06-17 22:37:32 +05:30
Tata Ganesh 67bc581bd3
Merge pull request #134 from timb07/issue_90
FIX: TypeError caused by bug in _parse_comment; #90 #89 #109
2018-06-14 09:27:34 +05:30
Tata Ganesh 7084d81bd1
Merge pull request #129 from clustree/xml-color
FEAT: Send color to XML conversion
2018-06-10 21:02:34 +05:30
Tata Ganesh 72d6e93045
FIX: Removed python 2.6 build
- python 2.6 is being no longer supported, and some of the PRs are failing the python 2.6 check
2018-06-10 19:52:33 +05:30
Tata Ganesh 653b4f33ee
Merge pull request #139 from gvmori/py2fontfix
FIX: Only check for bytes input to enc() in python3
Note - For python2, no checks have been put in  place
2018-06-10 16:51:23 +05:30
Tata Ganesh 5550b6a7df
Merge pull request #135 from AndydeCleyre/bugfix/big_P_is_for_Password
FIX: -P is not for page numbers, but passwords; reflect that in the help text
2018-06-10 16:05:09 +05:30
Martin Wolf 4bdb3ba8cc Fixes needed to be able to compile pdfminer.six with Cython 2018-04-12 00:05:38 +02:00
Tim Bell 1cbeaebfce Fix Python 2.6 incompatibility 2018-04-11 10:34:15 +10:00
Tim Bell f87bd1f171 Add sortedcontainers to TravisCI config 2018-04-11 10:18:52 +10:00
Tim Bell 0c8cf748fe Fix copy-paste error 2018-04-11 10:15:32 +10:00
Tim Bell 8f8a78bb88 Remove now-unused csort() 2018-04-11 09:37:32 +10:00
Tim Bell 2dda2b12b4 Speedup layout with .sort() and sortedcontainers.SortedListWithKey() 2018-04-11 09:03:32 +10:00
Gregory Mori 335c25c045 only check for bytes input to enc() in python3
In python2, isinstance("", bytes) is true, causing enc() to
suppress any string input. This results in fontnames being lost
when running pdf2txt.py in python2.

As this check was not present in the original python2 version of
pdfminer, restrict it to only check when running in python3.
2018-04-09 12:21:59 -07:00
Andy Kluger ed7d8308d9 -P is *not* for page numbers, but passwords, so reflect that in the help text 2018-04-03 12:26:01 -04:00
Tim Bell 981e3a575e Fix TypeError caused by bug in _parse_comment; #90 #89 #109 2018-04-03 12:47:40 +10:00
Tim Bell 083f11b165 Fix cases where a bytearray doesn't work in place of bytes 2018-04-03 07:27:29 +10:00
Tim Bell 185ddeb2ab Speed up handling of PDFs with large images with more minimal change 2018-04-03 07:21:21 +10:00
Tim Bell fab1c9462c Speed up handling of PDFs with large images 2018-03-29 14:21:31 +11:00
Tata Ganesh eddf861fbd
Merge pull request #125 from yosida95/bytes-type
Fix type of an argument to PDFFont#decode to bytes in py3
2018-03-19 11:00:10 +05:30