Pieter Marsman
63b2e09ac3
Merge pull request #203 from jbarlow83/negative-descent
...
Interpret font Descent as a negative number even if specified as positive
2019-10-13 20:06:52 +02:00
Tata Ganesh
f218996fe9
Merge pull request #273 from igormp/develop
...
Use resolve_all on PdfFont widths and bbox
2019-10-12 21:24:29 +05:30
Fakabbir Amin
7c03d96d25
Corrects Comment
2019-08-20 17:16:10 +05:30
Fakabbir Amin
abd685fdc6
Corrects Code Comment
2019-08-20 17:13:27 +05:30
Fakabbir Amin
3d549ea48c
Removes code comments
2019-08-20 16:48:40 +05:30
Igor Moura
cf4641d877
Merge branch 'develop' into develop
2019-08-15 08:11:28 -03:00
Fakabbir Amin
fe38695739
Merge branch 'develop' into pdfstream-as-cmap
2019-08-10 10:44:31 +05:30
Fakabbir Amin
5a0d8db052
Adds decoder for OnebyteIdentityH/V instead of using default CMap
2019-08-10 10:07:23 +05:30
Tata Ganesh
42e2c8143b
Merge pull request #263 from pietermarsman/261-glyph-list-specification
...
name2unicode() should follow the Adobe Glyph List Specification
2019-07-26 22:13:34 +05:30
Igor Moura
2f4518231f
Use resolve_all on PdfFont widths and bbox
...
Fixes #268
2019-07-24 15:10:13 -03:00
Igor Moura
540df9f676
Replaced .iteritems() and with six.iteritems() for Python 3 compat
...
This is a squashed commit, the previous messages can be seen bellow
This is the 1st commit message:
Replaced .iteritems() usage for .items()
Fixed some python 2 leftovers, as discussed in #267 . Also formatted code according to Black.\nThis possibly breaks some python 2 compatibility
This is the commit message #2 :
Reverted formatting and more spread six usage
2019-07-24 14:08:30 -03:00
Fakabbir Amin
f1a4dcea88
Adds Test Cases, Neater Code For CMap Assignment
2019-07-24 11:56:06 +05:30
Fakabbir Amin
fa400431f5
Adds Test, Removes Unnecessary Assumptions
2019-07-17 11:38:00 +05:30
Pieter Marsman
6f362f53fe
Raise a `KeyError` with a useful message if `unicode2name()` does not match any glyph name. Use this message to log debug statements.
2019-07-16 08:52:24 +02:00
Fakabbir Amin
cc40af3d2b
Removes @property, Adds docstring
2019-07-15 14:21:21 +05:30
Pieter Marsman
33cc9861ae
Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to ''
2019-07-14 15:19:17 +02:00
Fakabbir Amin
8e4a82ad8b
Corrects Indentation
2019-07-13 05:00:25 +05:30
Fakabbir Amin
c022358c8d
Encapsulates character map name
2019-07-13 04:52:24 +05:30
John Kesegich
8ab2e287be
Handle PDFStream as character map name in PDFCIDFont
2019-02-25 11:42:30 -06:00
James R. Barlow
2ede124142
Interpet font Descent as a negative number even if specified as positive
...
The PDF RM specifies that Descent should be negative. Fonts that claim
to have a positive Descent (not that it would make sense) always seem
to be wrong about this claim.
2018-11-03 23:17:48 -07:00
Sebastian Schuberth
fcd3e6ce00
Catch an error unpack might throw instead of checking the length before
2017-10-30 19:31:58 +01:00
Sebastian Schuberth
39428fb4f0
TrueTypeFont: Check for enough data to unpack
...
Fixes https://github.com/euske/pdfminer/issues/96
and https://github.com/euske/pdfminer/issues/144 .
2017-10-16 12:35:04 +02:00
Venelin Stoykov
171cdcc69d
Microoptimization for singlebyte fonts ( #84 )
...
Instead of list comprehension which will call a function to get the integer value of the bytes directly convert it to bytearray which is more optimal structure for storing list of bytes.
2017-08-18 08:10:27 +02:00
Hugh Secker-Walker
488545ddc7
Add string expressions to asserts showing local data ( #67 )
2017-05-29 09:06:09 +02:00
Andrew Baumann
9439a3a31a
Miscellaneous bug fixes ( #47 )
...
* utils.decode_text: fix "TypeError: ord() expected string of length 1, but int found"
fixes https://github.com/goulu/pdfminer/issues/24
* pdfinterp.execute: don't assume that every keyword name can be decoded as utf-8
fixes "'str' does not support the buffer interface", https://github.com/goulu/pdfminer/issues/23
* default settings.STRICT to False, for compatibility with the original pdfminer
* PDFCIDFont: handle font registry/orderings that may be PDFObjRefs
* utils.nunpack: handle 8-byte integers
2017-02-06 14:57:01 +01:00
Antonio Ercole De Luca
0fdebc6739
Removing all the "#!/usr/bin/env python" lines, they do not need for … ( #34 )
...
* Removing all the "#!/usr/bin/env python" lines, they do not need for python3, solving issue number: #19 .
* Restored all the shebangs in the tools and tests folders (because they are real executables) but used "#!/usr/bin/env python" instead of "#!/usr/bin/python" as this blog points out: https://www.peterbe.com/plog/importance-of-env
Removed also the shebang from pdfminer/psparser.py file.
2016-11-08 20:01:11 +01:00
Friedrich Lindenberg
70918095cc
Return an empty list when no `Differences` are found.
2016-09-24 11:57:11 +02:00
Steve Hair
92c71436b9
Improved settings management
2016-01-10 12:17:38 -05:00
Chris Hager
b686dd0139
pdfminer/settings.py for STRICT and added ENFORCE_CHECK_EXTRACTABLE
2015-11-01 22:28:08 +01:00
cybjit
9b0a3ee53e
decode cmap font name
2014-09-11 23:30:02 +02:00
cybjit
f9a67db89b
change xrange to range
2014-09-07 18:36:12 +02:00
unknown
4ab48d1803
Python 3.4 compatibility + tests
2014-09-04 09:36:19 +02:00
Yusuke Shinyama
1ccfaff411
String-Bytes distinction (first attempt).
2014-06-30 19:05:56 +09:00
Yusuke Shinyama
8791355e1d
Cleanup imports. Use relative imports.
2014-06-26 18:12:39 +09:00
Yusuke Shinyama
fe86b4e64e
Changed: StringIO -> io.BytesIO
2014-06-25 19:55:41 +09:00
Yusuke Shinyama
bb866ae148
Changed: new except syntax (2.6 or above).
2014-06-16 18:50:07 +09:00
Yusuke Shinyama
28e96ba3d0
Use print as a function.
2014-06-15 12:14:33 +09:00
Yusuke Shinyama
340387bfc6
Cleanup: isinstance
2014-03-28 17:50:59 +09:00
Yusuke Shinyama
c97ec3048e
Changed / to // for clarity.
2013-11-26 21:35:16 +09:00
Yusuke Shinyama
2b56b2eedf
Merged.
2013-11-07 19:50:41 +09:00
Matthew Duggan
2caa5edc25
PEP8: Whitespace changes to match pep8
2013-11-07 17:35:04 +09:00
Matthew Duggan
c1da8b835c
PEP8: Remove trailing whitespace
2013-11-07 16:14:53 +09:00
Matthew Duggan
10a68c83bd
Remove unused imports identified by pyflakes
2013-11-07 16:09:44 +09:00
Yusuke Shinyama
0ea08890d4
renamed: python2 -> python.
2013-10-17 23:05:27 +09:00
Yusuke Shinyama
de9f9715e3
Added: Adobe-UCS
2013-10-17 21:35:25 +09:00
Rico
6e4f36d9a1
get width based on utf-8 char.
...
fills some gaps and fixes inconsistencies between standard encodings
2011-07-23 16:34:11 +02:00
Yusuke Shinyama
18e782f330
canonicalize package names
2011-03-02 23:43:03 +09:00
Yusuke Shinyama
109aedeb43
cfffont extension with no luck
2011-01-25 00:19:07 +09:00
yusuke.shinyama.dummy
f4ced29713
bugfix by Kevin Brubeck Unhammer
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@278 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-12-25 08:40:45 +00:00
yusuke.shinyama.dummy
9f78915ea6
show cid for unknown characters
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@275 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-11-23 10:53:19 +00:00