Commit Graph

897 Commits (c2e516d6df3708d8fcc10b46ddcda2a50198539e)

Author SHA1 Message Date
D.A.Bashkirtsev 4df6d4e5ca Changed: comparations for image colorspace literals (#132)
Fixes #131 

Changed: comparations for image colorspace literals
Added: test for extracting images from pdfs
2019-10-15 16:11:54 +02:00
Pieter Marsman 63b2e09ac3
Merge pull request #203 from jbarlow83/negative-descent
Interpret font Descent as a negative number even if specified as positive
2019-10-13 20:06:52 +02:00
Pieter Marsman eae70b999c
Merge pull request #219 from FelixSchwarz/fixed-wheel-requirements
use conditional requirements to ensure "chardet" is always required on Python 3
2019-10-13 19:57:05 +02:00
Tata Ganesh c6ad95ea87
Merge pull request #298 from tongbaojia/fix_color_graphicstate_dev
Fix stoke color and non-stroke color in PDFGraphicState
2019-10-13 19:27:09 +05:30
Tony Tong 106a09c5bb fix stoke color and non-stroke color in PDFGraphicState 2019-10-12 17:35:46 -04:00
Tata Ganesh f218996fe9
Merge pull request #273 from igormp/develop
Use resolve_all on PdfFont widths and bbox
2019-10-12 21:24:29 +05:30
Tata Ganesh f53fbd98b1
Merge pull request #283 from fakabbir/pdfstream-as-cmap
Pdfstream as cmap
2019-10-12 21:22:52 +05:30
Tata Ganesh 41d1efac7e
Merge pull request #297 from pdfminer/revert-287-fix_color_graphicstate
Revert "Fix stoke color and non-stroke color in PDFGraphicState"
2019-10-12 21:19:58 +05:30
Tata Ganesh cc310d3d74
Revert "Fix stoke color and non-stroke color in PDFGraphicState" 2019-10-12 21:19:36 +05:30
Tata Ganesh f7056b482a
Merge pull request #287 from tongbaojia/fix_color_graphicstate
Fix stoke color and non-stroke color in PDFGraphicState
2019-10-12 21:19:31 +05:30
Tata Ganesh d6fd7e76b2
Merge pull request #286 from kaushikacharya/denis_article_name
Adding Denis's article name.
2019-09-08 18:33:56 +05:30
Tony Tong 2156efe64f fix stoke color and non-stroke color in PDFGraphicState 2019-08-22 10:24:51 -04:00
Fakabbir Amin 7c03d96d25 Corrects Comment 2019-08-20 17:16:10 +05:30
Fakabbir Amin abd685fdc6 Corrects Code Comment 2019-08-20 17:13:27 +05:30
Fakabbir Amin 3d549ea48c Removes code comments 2019-08-20 16:48:40 +05:30
Kaushik Acharya 963a227b2e Updated URL for the article 2019-08-19 20:16:34 +05:30
Igor Moura cf4641d877
Merge branch 'develop' into develop 2019-08-15 08:11:28 -03:00
Kaushik Acharya bfbb8b8f0b Adding Denis's article name. 2019-08-15 11:59:29 +05:30
Fakabbir Amin 3f0f05def6 Merge branch 'pdfstream-as-cmap' of https://github.com/fakabbir/pdfminer.six into pdfstream-as-cmap 2019-08-10 11:04:10 +05:30
Fakabbir Amin 3125d3634a Correct old test cases 2019-08-10 11:03:28 +05:30
Fakabbir Amin fe38695739
Merge branch 'develop' into pdfstream-as-cmap 2019-08-10 10:44:31 +05:30
Fakabbir Amin 5b210981c9 Adds Test Case 2019-08-10 10:19:20 +05:30
Fakabbir Amin 5a0d8db052 Adds decoder for OnebyteIdentityH/V instead of using default CMap 2019-08-10 10:07:23 +05:30
Tata Ganesh 48b25939c2
Merge pull request #264 from fakabbir/pdfstream-as-cmap
Pdfstream as cmap
2019-07-31 22:20:37 +05:30
Tata Ganesh 42e2c8143b
Merge pull request #263 from pietermarsman/261-glyph-list-specification
name2unicode() should follow the Adobe Glyph List Specification
2019-07-26 22:13:34 +05:30
Igor Moura 4eb6f769ea Merge branch 'develop' of github.com:pdfminer/pdfminer.six into develop 2019-07-24 15:16:12 -03:00
Igor Moura 2f4518231f Use resolve_all on PdfFont widths and bbox
Fixes #268
2019-07-24 15:10:13 -03:00
Tata Ganesh 17364aa88e
Merge pull request #274 from igormp/iteritems
Replaced .iteritems() with six.iteritems() for Python 2 and 3 compat
2019-07-24 23:16:25 +05:30
Igor Moura 540df9f676 Replaced .iteritems() and with six.iteritems() for Python 3 compat
This is a squashed commit, the previous messages can be seen bellow

This is the 1st commit message:

Replaced .iteritems() usage for .items()

Fixed some python 2 leftovers, as discussed in #267. Also formatted code according to Black.\nThis possibly breaks some python 2 compatibility

This is the commit message #2:

Reverted formatting and more spread six usage
2019-07-24 14:08:30 -03:00
Fakabbir Amin f1a4dcea88 Adds Test Cases, Neater Code For CMap Assignment 2019-07-24 11:56:06 +05:30
Fakabbir Amin b4c261b647 Removes Code Comments 2019-07-17 11:43:45 +05:30
Fakabbir Amin fa400431f5 Adds Test, Removes Unnecessary Assumptions 2019-07-17 11:38:00 +05:30
Pieter Marsman 6f362f53fe Raise a `KeyError` with a useful message if `unicode2name()` does not match any glyph name. Use this message to log debug statements. 2019-07-16 08:52:24 +02:00
Pieter Marsman 0fb83366b6 Remove intermediate variable `full_stop` because it is just a dot 2019-07-16 08:49:57 +02:00
Fakabbir Amin cc40af3d2b Removes @property, Adds docstring 2019-07-15 14:21:21 +05:30
Pieter Marsman 2bb850cdae Fix error, python2 cannot handle unicode in a .py file 2019-07-14 15:43:07 +02:00
Pieter Marsman 1e24bfa0bd Fix error, python2 cannot handle unicode in a .py file 2019-07-14 15:40:22 +02:00
Pieter Marsman c597e95a9f Use KeyError to signal that the name does not resemble any unicode, this pattern is also used in the rest of pdfminer.six 2019-07-14 15:37:15 +02:00
Pieter Marsman fdb7e54862 Add lowercase adobe glyph name tests 2019-07-14 15:20:25 +02:00
Pieter Marsman 33cc9861ae Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to '' 2019-07-14 15:19:17 +02:00
Pieter Marsman f0392f8049 Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase) 2019-07-14 15:16:42 +02:00
Fakabbir Amin 8e4a82ad8b Corrects Indentation 2019-07-13 05:00:25 +05:30
Fakabbir Amin c022358c8d Encapsulates character map name 2019-07-13 04:52:24 +05:30
Pieter Marsman 5d7ac7e88a Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_ 2019-07-10 20:44:23 +02:00
Pieter Marsman ec5218a05f Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification 2019-07-10 20:35:42 +02:00
Tata Ganesh 127d587431
Merge pull request #227 from BackEndTea/patch-1
Remove self refference on python3
2019-07-10 23:12:30 +05:30
Tata Ganesh 578464ab00
Merge pull request #259 from pietermarsman/258-add-contribution-guidelines
Add contribution guidelines and refer to it in the readme
2019-07-10 23:10:25 +05:30
Pieter Marsman 5acfdd8f9b Add sentence about including pdf's in issues 2019-07-10 19:38:38 +02:00
Andrew Baumann 6b312edd6a
Merge pull request #230 from 0xabu/unicode_glyph_bug
name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction

Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.
2019-07-09 12:35:39 -07:00
Pieter Marsman 2743f2b20a Add reference to contribution guidelines in README.md 2019-07-08 23:05:47 +02:00