Tata Ganesh
48b25939c2
Merge pull request #264 from fakabbir/pdfstream-as-cmap
...
Pdfstream as cmap
2019-07-31 22:20:37 +05:30
Tata Ganesh
42e2c8143b
Merge pull request #263 from pietermarsman/261-glyph-list-specification
...
name2unicode() should follow the Adobe Glyph List Specification
2019-07-26 22:13:34 +05:30
Igor Moura
4eb6f769ea
Merge branch 'develop' of github.com:pdfminer/pdfminer.six into develop
2019-07-24 15:16:12 -03:00
Igor Moura
2f4518231f
Use resolve_all on PdfFont widths and bbox
...
Fixes #268
2019-07-24 15:10:13 -03:00
Tata Ganesh
17364aa88e
Merge pull request #274 from igormp/iteritems
...
Replaced .iteritems() with six.iteritems() for Python 2 and 3 compat
2019-07-24 23:16:25 +05:30
Igor Moura
540df9f676
Replaced .iteritems() and with six.iteritems() for Python 3 compat
...
This is a squashed commit, the previous messages can be seen bellow
This is the 1st commit message:
Replaced .iteritems() usage for .items()
Fixed some python 2 leftovers, as discussed in #267 . Also formatted code according to Black.\nThis possibly breaks some python 2 compatibility
This is the commit message #2 :
Reverted formatting and more spread six usage
2019-07-24 14:08:30 -03:00
Fakabbir Amin
f1a4dcea88
Adds Test Cases, Neater Code For CMap Assignment
2019-07-24 11:56:06 +05:30
Fakabbir Amin
b4c261b647
Removes Code Comments
2019-07-17 11:43:45 +05:30
Fakabbir Amin
fa400431f5
Adds Test, Removes Unnecessary Assumptions
2019-07-17 11:38:00 +05:30
Pieter Marsman
6f362f53fe
Raise a `KeyError` with a useful message if `unicode2name()` does not match any glyph name. Use this message to log debug statements.
2019-07-16 08:52:24 +02:00
Pieter Marsman
0fb83366b6
Remove intermediate variable `full_stop` because it is just a dot
2019-07-16 08:49:57 +02:00
Fakabbir Amin
cc40af3d2b
Removes @property, Adds docstring
2019-07-15 14:21:21 +05:30
Pieter Marsman
2bb850cdae
Fix error, python2 cannot handle unicode in a .py file
2019-07-14 15:43:07 +02:00
Pieter Marsman
1e24bfa0bd
Fix error, python2 cannot handle unicode in a .py file
2019-07-14 15:40:22 +02:00
Pieter Marsman
c597e95a9f
Use KeyError to signal that the name does not resemble any unicode, this pattern is also used in the rest of pdfminer.six
2019-07-14 15:37:15 +02:00
Pieter Marsman
fdb7e54862
Add lowercase adobe glyph name tests
2019-07-14 15:20:25 +02:00
Pieter Marsman
33cc9861ae
Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to ''
2019-07-14 15:19:17 +02:00
Pieter Marsman
f0392f8049
Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase)
2019-07-14 15:16:42 +02:00
Fakabbir Amin
8e4a82ad8b
Corrects Indentation
2019-07-13 05:00:25 +05:30
Fakabbir Amin
c022358c8d
Encapsulates character map name
2019-07-13 04:52:24 +05:30
Pieter Marsman
5d7ac7e88a
Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_
2019-07-10 20:44:23 +02:00
Pieter Marsman
ec5218a05f
Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification
2019-07-10 20:35:42 +02:00
Tata Ganesh
127d587431
Merge pull request #227 from BackEndTea/patch-1
...
Remove self refference on python3
2019-07-10 23:12:30 +05:30
Tata Ganesh
578464ab00
Merge pull request #259 from pietermarsman/258-add-contribution-guidelines
...
Add contribution guidelines and refer to it in the readme
2019-07-10 23:10:25 +05:30
Pieter Marsman
5acfdd8f9b
Add sentence about including pdf's in issues
2019-07-10 19:38:38 +02:00
Andrew Baumann
6b312edd6a
Merge pull request #230 from 0xabu/unicode_glyph_bug
...
name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction
Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification ). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.
2019-07-09 12:35:39 -07:00
Pieter Marsman
2743f2b20a
Add reference to contribution guidelines in README.md
2019-07-08 23:05:47 +02:00
Pieter Marsman
a03566da21
Add contribution guidelines
2019-07-08 23:03:47 +02:00
Andrew Baumann
c4c0a36e4f
name2unicode(): handle hexadecimal constants for unicode glyphs
...
fixes #183 , #229
2019-02-25 13:08:04 -08:00
John Kesegich
8ab2e287be
Handle PDFStream as character map name in PDFCIDFont
2019-02-25 11:42:30 -06:00
Gert de Pagter
594321fcf6
Remove self refference on python3
...
This *is* the 'six' repo, so no need to mention that again in the readme
2019-02-25 16:38:46 +01:00
Felix Schwarz
5ff84b83fb
use conditional requirements to ensure "chardet" listed as requirement on Python 3 ( fixes #213 )
...
Previously "chardet" was added only added when setup.py was run with Python 3.
However wheels contain a static list of requirements and a wheel-based
install will never execute setup.py at installation time.
pdfminer.six uses universal wheels for Python 2 and Python 3 so the
requirements will always be wrong on one version (see #213 ).
The solution is to use conditional requirements as specified in PEP 496
which are evaluated at installation time.
2019-01-18 11:24:51 +01:00
ganeshtata
b6a5848208
FEAT: Release 20181108
2018-11-08 22:37:11 +05:30
ganeshtata
90d61f2a3a
DOCS: changelog for version 20181108 added
2018-11-08 22:08:17 +05:30
Tata Ganesh
e03ecab856
Merge pull request #141 from timb07/speedup_layout
...
Speed up layout of text boxes
2018-11-08 20:28:40 +05:30
James R. Barlow
2ede124142
Interpet font Descent as a negative number even if specified as positive
...
The PDF RM specifies that Descent should be negative. Fonts that claim
to have a positive Descent (not that it would make sense) always seem
to be wrong about this claim.
2018-11-03 23:17:48 -07:00
Tata Ganesh
1ea9446bd6
Merge pull request #173 from wlbentley/pdf2txt-maketheparser
...
Move argparse object setup out of pdf2txt.py main to separate function.
2018-08-21 23:40:50 +05:30
Wm Bentley
495c92e050
Move argparse object setup out of main to separate function.
...
As preparation for implementing Sphinx documentation, create a
separate function that builds and returns the argparse parser.
Move import argparse out of main to the top of the file.
2018-08-12 21:07:52 -07:00
Tata Ganesh
259b29299e
Merge pull request #133 from timb07/speedup
...
Speed up handling of PDFs with large images
2018-07-15 11:27:35 +05:30
Tata Ganesh
d437e5c637
Merge pull request #142 from mawoqiw/master
...
Fixes needed to be able to compile pdfminer.six with Cython
2018-07-14 18:02:29 +05:30
Martin Wolf
edaf2c9e3f
move unittest to main()
2018-06-26 00:51:51 +02:00
Martin Wolf
603a927e6c
remove pygame from dependencies, it is actually not required to compile with Cython
2018-06-26 00:50:45 +02:00
Martin Wolf
eff3f19886
Merge remote-tracking branch 'upstream/master'
2018-06-25 23:32:52 +02:00
Tata Ganesh
9c7bdcc716
Merge pull request #157 from h2ri/master
...
decode cid: 160 and 173 to spaces
2018-06-25 11:19:27 +05:30
Tata Ganesh
07eafe7b27
Merge pull request #160 from charlesreid1/fix-line-endings
...
apply dos2unix to files in pdfminer/tools dirs to remove \r\n windows line endings
2018-06-25 11:19:14 +05:30
Charles Reid
7b08cdbff9
apply dos2unix to files in pdfminer/ and tools/ to remove \r\n windows line endings
2018-06-21 12:19:48 -07:00
Goulu
1db260609e
render_string must have 5 params in all PDFDevice classes ( #158 )
2018-06-21 10:21:26 +02:00
Guglielmetti Philippe
70624a64dd
render_string() now takes 3 parameters, not 5 (reverted from commit 95b65536af
)
2018-06-21 09:49:45 +02:00
Guglielmetti Philippe
95b65536af
render_string() now takes 3 parameters, not 5
2018-06-21 09:28:55 +02:00
Healthi
65eb0cef82
decode cid: 160 and 170 to spaces
2018-06-20 17:17:03 +05:30