Commit Graph

763 Commits (733ddf7e570d9d20c28de7f965c07aec464a936d)

Author SHA1 Message Date
Fakabbir Amin cc40af3d2b Removes @property, Adds docstring 2019-07-15 14:21:21 +05:30
Pieter Marsman 2bb850cdae Fix error, python2 cannot handle unicode in a .py file 2019-07-14 15:43:07 +02:00
Pieter Marsman 1e24bfa0bd Fix error, python2 cannot handle unicode in a .py file 2019-07-14 15:40:22 +02:00
Pieter Marsman c597e95a9f Use KeyError to signal that the name does not resemble any unicode, this pattern is also used in the rest of pdfminer.six 2019-07-14 15:37:15 +02:00
Pieter Marsman fdb7e54862 Add lowercase adobe glyph name tests 2019-07-14 15:20:25 +02:00
Pieter Marsman 33cc9861ae Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to '' 2019-07-14 15:19:17 +02:00
Pieter Marsman f0392f8049 Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase) 2019-07-14 15:16:42 +02:00
Fakabbir Amin 8e4a82ad8b Corrects Indentation 2019-07-13 05:00:25 +05:30
Fakabbir Amin c022358c8d Encapsulates character map name 2019-07-13 04:52:24 +05:30
Pieter Marsman 5d7ac7e88a Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_ 2019-07-10 20:44:23 +02:00
Pieter Marsman ec5218a05f Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification 2019-07-10 20:35:42 +02:00
Tata Ganesh 127d587431
Merge pull request #227 from BackEndTea/patch-1
Remove self refference on python3
2019-07-10 23:12:30 +05:30
Tata Ganesh 578464ab00
Merge pull request #259 from pietermarsman/258-add-contribution-guidelines
Add contribution guidelines and refer to it in the readme
2019-07-10 23:10:25 +05:30
Pieter Marsman 5acfdd8f9b Add sentence about including pdf's in issues 2019-07-10 19:38:38 +02:00
Andrew Baumann 6b312edd6a
Merge pull request #230 from 0xabu/unicode_glyph_bug
name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction

Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.
2019-07-09 12:35:39 -07:00
Pieter Marsman 2743f2b20a Add reference to contribution guidelines in README.md 2019-07-08 23:05:47 +02:00
Pieter Marsman a03566da21 Add contribution guidelines 2019-07-08 23:03:47 +02:00
Andrew Baumann c4c0a36e4f name2unicode(): handle hexadecimal constants for unicode glyphs
fixes #183, #229
2019-02-25 13:08:04 -08:00
John Kesegich 8ab2e287be Handle PDFStream as character map name in PDFCIDFont 2019-02-25 11:42:30 -06:00
Gert de Pagter 594321fcf6
Remove self refference on python3
This *is* the 'six' repo, so no need to mention that again in the readme
2019-02-25 16:38:46 +01:00
Felix Schwarz 5ff84b83fb use conditional requirements to ensure "chardet" listed as requirement on Python 3 (fixes #213)
Previously "chardet" was added only added when setup.py was run with Python 3.
However wheels contain a static list of requirements and a wheel-based
install will never execute setup.py at installation time.

pdfminer.six uses universal wheels for Python 2 and Python 3 so the
requirements will always be wrong on one version (see #213).

The solution is to use conditional requirements as specified in PEP 496
which are evaluated at installation time.
2019-01-18 11:24:51 +01:00
ganeshtata b6a5848208 FEAT: Release 20181108 2018-11-08 22:37:11 +05:30
ganeshtata 90d61f2a3a DOCS: changelog for version 20181108 added 2018-11-08 22:08:17 +05:30
Tata Ganesh e03ecab856
Merge pull request #141 from timb07/speedup_layout
Speed up layout of text boxes
2018-11-08 20:28:40 +05:30
James R. Barlow 2ede124142 Interpet font Descent as a negative number even if specified as positive
The PDF RM specifies that Descent should be negative. Fonts that claim
to have a positive Descent (not that it would make sense) always seem
to be wrong about this claim.
2018-11-03 23:17:48 -07:00
Tata Ganesh 1ea9446bd6
Merge pull request #173 from wlbentley/pdf2txt-maketheparser
Move argparse object setup out of pdf2txt.py main to separate function.
2018-08-21 23:40:50 +05:30
Wm Bentley 495c92e050 Move argparse object setup out of main to separate function.
As preparation for implementing Sphinx documentation, create a
separate function that builds and returns the argparse parser.
Move import argparse out of main to the top of the file.
2018-08-12 21:07:52 -07:00
Tata Ganesh 259b29299e
Merge pull request #133 from timb07/speedup
Speed up handling of PDFs with large images
2018-07-15 11:27:35 +05:30
Tata Ganesh d437e5c637
Merge pull request #142 from mawoqiw/master
Fixes needed to be able to compile pdfminer.six with Cython
2018-07-14 18:02:29 +05:30
Martin Wolf edaf2c9e3f move unittest to main() 2018-06-26 00:51:51 +02:00
Martin Wolf 603a927e6c remove pygame from dependencies, it is actually not required to compile with Cython 2018-06-26 00:50:45 +02:00
Martin Wolf eff3f19886 Merge remote-tracking branch 'upstream/master' 2018-06-25 23:32:52 +02:00
Tata Ganesh 9c7bdcc716
Merge pull request #157 from h2ri/master
decode cid: 160 and 173 to spaces
2018-06-25 11:19:27 +05:30
Tata Ganesh 07eafe7b27
Merge pull request #160 from charlesreid1/fix-line-endings
apply dos2unix to files in pdfminer/tools dirs to remove \r\n windows line endings
2018-06-25 11:19:14 +05:30
Charles Reid 7b08cdbff9 apply dos2unix to files in pdfminer/ and tools/ to remove \r\n windows line endings 2018-06-21 12:19:48 -07:00
Goulu 1db260609e
render_string must have 5 params in all PDFDevice classes (#158) 2018-06-21 10:21:26 +02:00
Guglielmetti Philippe 70624a64dd render_string() now takes 3 parameters, not 5 (reverted from commit 95b65536af) 2018-06-21 09:49:45 +02:00
Guglielmetti Philippe 95b65536af render_string() now takes 3 parameters, not 5 2018-06-21 09:28:55 +02:00
Healthi 65eb0cef82 decode cid: 160 and 170 to spaces 2018-06-20 17:17:03 +05:30
Martin Wolf 26f80715ed Merge remote-tracking branch 'upstream/master' 2018-06-20 13:27:18 +02:00
Tata Ganesh ac8bb81c72
DOCS: Update Readme.md 2018-06-17 22:37:32 +05:30
Tata Ganesh 67bc581bd3
Merge pull request #134 from timb07/issue_90
FIX: TypeError caused by bug in _parse_comment; #90 #89 #109
2018-06-14 09:27:34 +05:30
Tata Ganesh 7084d81bd1
Merge pull request #129 from clustree/xml-color
FEAT: Send color to XML conversion
2018-06-10 21:02:34 +05:30
Tata Ganesh 72d6e93045
FIX: Removed python 2.6 build
- python 2.6 is being no longer supported, and some of the PRs are failing the python 2.6 check
2018-06-10 19:52:33 +05:30
Tata Ganesh 653b4f33ee
Merge pull request #139 from gvmori/py2fontfix
FIX: Only check for bytes input to enc() in python3
Note - For python2, no checks have been put in  place
2018-06-10 16:51:23 +05:30
Tata Ganesh 5550b6a7df
Merge pull request #135 from AndydeCleyre/bugfix/big_P_is_for_Password
FIX: -P is not for page numbers, but passwords; reflect that in the help text
2018-06-10 16:05:09 +05:30
Martin Wolf 4bdb3ba8cc Fixes needed to be able to compile pdfminer.six with Cython 2018-04-12 00:05:38 +02:00
Tim Bell 1cbeaebfce Fix Python 2.6 incompatibility 2018-04-11 10:34:15 +10:00
Tim Bell f87bd1f171 Add sortedcontainers to TravisCI config 2018-04-11 10:18:52 +10:00
Tim Bell 0c8cf748fe Fix copy-paste error 2018-04-11 10:15:32 +10:00