Pieter Marsman
fdb7e54862
Add lowercase adobe glyph name tests
2019-07-14 15:20:25 +02:00
Pieter Marsman
33cc9861ae
Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to ''
2019-07-14 15:19:17 +02:00
Pieter Marsman
f0392f8049
Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase)
2019-07-14 15:16:42 +02:00
Fakabbir Amin
8e4a82ad8b
Corrects Indentation
2019-07-13 05:00:25 +05:30
Fakabbir Amin
c022358c8d
Encapsulates character map name
2019-07-13 04:52:24 +05:30
Pieter Marsman
5d7ac7e88a
Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_
2019-07-10 20:44:23 +02:00
Pieter Marsman
ec5218a05f
Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification
2019-07-10 20:35:42 +02:00
Tata Ganesh
127d587431
Merge pull request #227 from BackEndTea/patch-1
...
Remove self refference on python3
2019-07-10 23:12:30 +05:30
Tata Ganesh
578464ab00
Merge pull request #259 from pietermarsman/258-add-contribution-guidelines
...
Add contribution guidelines and refer to it in the readme
2019-07-10 23:10:25 +05:30
Pieter Marsman
5acfdd8f9b
Add sentence about including pdf's in issues
2019-07-10 19:38:38 +02:00
Andrew Baumann
6b312edd6a
Merge pull request #230 from 0xabu/unicode_glyph_bug
...
name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction
Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification ). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.
2019-07-09 12:35:39 -07:00
Pieter Marsman
2743f2b20a
Add reference to contribution guidelines in README.md
2019-07-08 23:05:47 +02:00
Pieter Marsman
a03566da21
Add contribution guidelines
2019-07-08 23:03:47 +02:00
Andrew Baumann
c4c0a36e4f
name2unicode(): handle hexadecimal constants for unicode glyphs
...
fixes #183 , #229
2019-02-25 13:08:04 -08:00
John Kesegich
8ab2e287be
Handle PDFStream as character map name in PDFCIDFont
2019-02-25 11:42:30 -06:00
Gert de Pagter
594321fcf6
Remove self refference on python3
...
This *is* the 'six' repo, so no need to mention that again in the readme
2019-02-25 16:38:46 +01:00
Felix Schwarz
5ff84b83fb
use conditional requirements to ensure "chardet" listed as requirement on Python 3 ( fixes #213 )
...
Previously "chardet" was added only added when setup.py was run with Python 3.
However wheels contain a static list of requirements and a wheel-based
install will never execute setup.py at installation time.
pdfminer.six uses universal wheels for Python 2 and Python 3 so the
requirements will always be wrong on one version (see #213 ).
The solution is to use conditional requirements as specified in PEP 496
which are evaluated at installation time.
2019-01-18 11:24:51 +01:00
ganeshtata
b6a5848208
FEAT: Release 20181108
2018-11-08 22:37:11 +05:30
ganeshtata
90d61f2a3a
DOCS: changelog for version 20181108 added
2018-11-08 22:08:17 +05:30
Tata Ganesh
e03ecab856
Merge pull request #141 from timb07/speedup_layout
...
Speed up layout of text boxes
2018-11-08 20:28:40 +05:30
James R. Barlow
2ede124142
Interpet font Descent as a negative number even if specified as positive
...
The PDF RM specifies that Descent should be negative. Fonts that claim
to have a positive Descent (not that it would make sense) always seem
to be wrong about this claim.
2018-11-03 23:17:48 -07:00
Tata Ganesh
1ea9446bd6
Merge pull request #173 from wlbentley/pdf2txt-maketheparser
...
Move argparse object setup out of pdf2txt.py main to separate function.
2018-08-21 23:40:50 +05:30
Wm Bentley
495c92e050
Move argparse object setup out of main to separate function.
...
As preparation for implementing Sphinx documentation, create a
separate function that builds and returns the argparse parser.
Move import argparse out of main to the top of the file.
2018-08-12 21:07:52 -07:00
Tata Ganesh
259b29299e
Merge pull request #133 from timb07/speedup
...
Speed up handling of PDFs with large images
2018-07-15 11:27:35 +05:30
Tata Ganesh
d437e5c637
Merge pull request #142 from mawoqiw/master
...
Fixes needed to be able to compile pdfminer.six with Cython
2018-07-14 18:02:29 +05:30
Martin Wolf
edaf2c9e3f
move unittest to main()
2018-06-26 00:51:51 +02:00
Martin Wolf
603a927e6c
remove pygame from dependencies, it is actually not required to compile with Cython
2018-06-26 00:50:45 +02:00
Martin Wolf
eff3f19886
Merge remote-tracking branch 'upstream/master'
2018-06-25 23:32:52 +02:00
Tata Ganesh
9c7bdcc716
Merge pull request #157 from h2ri/master
...
decode cid: 160 and 173 to spaces
2018-06-25 11:19:27 +05:30
Tata Ganesh
07eafe7b27
Merge pull request #160 from charlesreid1/fix-line-endings
...
apply dos2unix to files in pdfminer/tools dirs to remove \r\n windows line endings
2018-06-25 11:19:14 +05:30
Charles Reid
7b08cdbff9
apply dos2unix to files in pdfminer/ and tools/ to remove \r\n windows line endings
2018-06-21 12:19:48 -07:00
Goulu
1db260609e
render_string must have 5 params in all PDFDevice classes ( #158 )
2018-06-21 10:21:26 +02:00
Guglielmetti Philippe
70624a64dd
render_string() now takes 3 parameters, not 5 (reverted from commit 95b65536af
)
2018-06-21 09:49:45 +02:00
Guglielmetti Philippe
95b65536af
render_string() now takes 3 parameters, not 5
2018-06-21 09:28:55 +02:00
Healthi
65eb0cef82
decode cid: 160 and 170 to spaces
2018-06-20 17:17:03 +05:30
Martin Wolf
26f80715ed
Merge remote-tracking branch 'upstream/master'
2018-06-20 13:27:18 +02:00
Tata Ganesh
ac8bb81c72
DOCS: Update Readme.md
2018-06-17 22:37:32 +05:30
Tata Ganesh
67bc581bd3
Merge pull request #134 from timb07/issue_90
...
FIX: TypeError caused by bug in _parse_comment; #90 #89 #109
2018-06-14 09:27:34 +05:30
Tata Ganesh
7084d81bd1
Merge pull request #129 from clustree/xml-color
...
FEAT: Send color to XML conversion
2018-06-10 21:02:34 +05:30
Tata Ganesh
72d6e93045
FIX: Removed python 2.6 build
...
- python 2.6 is being no longer supported, and some of the PRs are failing the python 2.6 check
2018-06-10 19:52:33 +05:30
Tata Ganesh
653b4f33ee
Merge pull request #139 from gvmori/py2fontfix
...
FIX: Only check for bytes input to enc() in python3
Note - For python2, no checks have been put in place
2018-06-10 16:51:23 +05:30
Tata Ganesh
5550b6a7df
Merge pull request #135 from AndydeCleyre/bugfix/big_P_is_for_Password
...
FIX: -P is not for page numbers, but passwords; reflect that in the help text
2018-06-10 16:05:09 +05:30
Martin Wolf
4bdb3ba8cc
Fixes needed to be able to compile pdfminer.six with Cython
2018-04-12 00:05:38 +02:00
Tim Bell
1cbeaebfce
Fix Python 2.6 incompatibility
2018-04-11 10:34:15 +10:00
Tim Bell
f87bd1f171
Add sortedcontainers to TravisCI config
2018-04-11 10:18:52 +10:00
Tim Bell
0c8cf748fe
Fix copy-paste error
2018-04-11 10:15:32 +10:00
Tim Bell
8f8a78bb88
Remove now-unused csort()
2018-04-11 09:37:32 +10:00
Tim Bell
2dda2b12b4
Speedup layout with .sort() and sortedcontainers.SortedListWithKey()
2018-04-11 09:03:32 +10:00
Gregory Mori
335c25c045
only check for bytes input to enc() in python3
...
In python2, isinstance("", bytes) is true, causing enc() to
suppress any string input. This results in fontnames being lost
when running pdf2txt.py in python2.
As this check was not present in the original python2 version of
pdfminer, restrict it to only check when running in python3.
2018-04-09 12:21:59 -07:00
Andy Kluger
ed7d8308d9
-P is *not* for page numbers, but passwords, so reflect that in the help text
2018-04-03 12:26:01 -04:00