Commit Graph

708 Commits (41d1efac7ef3ac71164eeb8200af39a2bfa09c59)

Author SHA1 Message Date
Tata Ganesh 41d1efac7e
Merge pull request #297 from pdfminer/revert-287-fix_color_graphicstate
Revert "Fix stoke color and non-stroke color in PDFGraphicState"
2019-10-12 21:19:58 +05:30
Tata Ganesh cc310d3d74
Revert "Fix stoke color and non-stroke color in PDFGraphicState" 2019-10-12 21:19:36 +05:30
Tata Ganesh f7056b482a
Merge pull request #287 from tongbaojia/fix_color_graphicstate
Fix stoke color and non-stroke color in PDFGraphicState
2019-10-12 21:19:31 +05:30
Tata Ganesh d6fd7e76b2
Merge pull request #286 from kaushikacharya/denis_article_name
Adding Denis's article name.
2019-09-08 18:33:56 +05:30
Tony Tong 2156efe64f fix stoke color and non-stroke color in PDFGraphicState 2019-08-22 10:24:51 -04:00
Kaushik Acharya 963a227b2e Updated URL for the article 2019-08-19 20:16:34 +05:30
Kaushik Acharya bfbb8b8f0b Adding Denis's article name. 2019-08-15 11:59:29 +05:30
Tata Ganesh 48b25939c2
Merge pull request #264 from fakabbir/pdfstream-as-cmap
Pdfstream as cmap
2019-07-31 22:20:37 +05:30
Fakabbir Amin f1a4dcea88 Adds Test Cases, Neater Code For CMap Assignment 2019-07-24 11:56:06 +05:30
Fakabbir Amin b4c261b647 Removes Code Comments 2019-07-17 11:43:45 +05:30
Fakabbir Amin fa400431f5 Adds Test, Removes Unnecessary Assumptions 2019-07-17 11:38:00 +05:30
Fakabbir Amin cc40af3d2b Removes @property, Adds docstring 2019-07-15 14:21:21 +05:30
Fakabbir Amin 8e4a82ad8b Corrects Indentation 2019-07-13 05:00:25 +05:30
Fakabbir Amin c022358c8d Encapsulates character map name 2019-07-13 04:52:24 +05:30
Andrew Baumann 6b312edd6a
Merge pull request #230 from 0xabu/unicode_glyph_bug
name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction

Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.
2019-07-09 12:35:39 -07:00
Andrew Baumann c4c0a36e4f name2unicode(): handle hexadecimal constants for unicode glyphs
fixes #183, #229
2019-02-25 13:08:04 -08:00
John Kesegich 8ab2e287be Handle PDFStream as character map name in PDFCIDFont 2019-02-25 11:42:30 -06:00
ganeshtata b6a5848208 FEAT: Release 20181108 2018-11-08 22:37:11 +05:30
ganeshtata 90d61f2a3a DOCS: changelog for version 20181108 added 2018-11-08 22:08:17 +05:30
Tata Ganesh e03ecab856
Merge pull request #141 from timb07/speedup_layout
Speed up layout of text boxes
2018-11-08 20:28:40 +05:30
Tata Ganesh 1ea9446bd6
Merge pull request #173 from wlbentley/pdf2txt-maketheparser
Move argparse object setup out of pdf2txt.py main to separate function.
2018-08-21 23:40:50 +05:30
Wm Bentley 495c92e050 Move argparse object setup out of main to separate function.
As preparation for implementing Sphinx documentation, create a
separate function that builds and returns the argparse parser.
Move import argparse out of main to the top of the file.
2018-08-12 21:07:52 -07:00
Tata Ganesh 259b29299e
Merge pull request #133 from timb07/speedup
Speed up handling of PDFs with large images
2018-07-15 11:27:35 +05:30
Tata Ganesh d437e5c637
Merge pull request #142 from mawoqiw/master
Fixes needed to be able to compile pdfminer.six with Cython
2018-07-14 18:02:29 +05:30
Martin Wolf edaf2c9e3f move unittest to main() 2018-06-26 00:51:51 +02:00
Martin Wolf 603a927e6c remove pygame from dependencies, it is actually not required to compile with Cython 2018-06-26 00:50:45 +02:00
Martin Wolf eff3f19886 Merge remote-tracking branch 'upstream/master' 2018-06-25 23:32:52 +02:00
Tata Ganesh 9c7bdcc716
Merge pull request #157 from h2ri/master
decode cid: 160 and 173 to spaces
2018-06-25 11:19:27 +05:30
Tata Ganesh 07eafe7b27
Merge pull request #160 from charlesreid1/fix-line-endings
apply dos2unix to files in pdfminer/tools dirs to remove \r\n windows line endings
2018-06-25 11:19:14 +05:30
Charles Reid 7b08cdbff9 apply dos2unix to files in pdfminer/ and tools/ to remove \r\n windows line endings 2018-06-21 12:19:48 -07:00
Goulu 1db260609e
render_string must have 5 params in all PDFDevice classes (#158) 2018-06-21 10:21:26 +02:00
Guglielmetti Philippe 70624a64dd render_string() now takes 3 parameters, not 5 (reverted from commit 95b65536af) 2018-06-21 09:49:45 +02:00
Guglielmetti Philippe 95b65536af render_string() now takes 3 parameters, not 5 2018-06-21 09:28:55 +02:00
Healthi 65eb0cef82 decode cid: 160 and 170 to spaces 2018-06-20 17:17:03 +05:30
Martin Wolf 26f80715ed Merge remote-tracking branch 'upstream/master' 2018-06-20 13:27:18 +02:00
Tata Ganesh ac8bb81c72
DOCS: Update Readme.md 2018-06-17 22:37:32 +05:30
Tata Ganesh 67bc581bd3
Merge pull request #134 from timb07/issue_90
FIX: TypeError caused by bug in _parse_comment; #90 #89 #109
2018-06-14 09:27:34 +05:30
Tata Ganesh 7084d81bd1
Merge pull request #129 from clustree/xml-color
FEAT: Send color to XML conversion
2018-06-10 21:02:34 +05:30
Tata Ganesh 72d6e93045
FIX: Removed python 2.6 build
- python 2.6 is being no longer supported, and some of the PRs are failing the python 2.6 check
2018-06-10 19:52:33 +05:30
Tata Ganesh 653b4f33ee
Merge pull request #139 from gvmori/py2fontfix
FIX: Only check for bytes input to enc() in python3
Note - For python2, no checks have been put in  place
2018-06-10 16:51:23 +05:30
Tata Ganesh 5550b6a7df
Merge pull request #135 from AndydeCleyre/bugfix/big_P_is_for_Password
FIX: -P is not for page numbers, but passwords; reflect that in the help text
2018-06-10 16:05:09 +05:30
Martin Wolf 4bdb3ba8cc Fixes needed to be able to compile pdfminer.six with Cython 2018-04-12 00:05:38 +02:00
Tim Bell 1cbeaebfce Fix Python 2.6 incompatibility 2018-04-11 10:34:15 +10:00
Tim Bell f87bd1f171 Add sortedcontainers to TravisCI config 2018-04-11 10:18:52 +10:00
Tim Bell 0c8cf748fe Fix copy-paste error 2018-04-11 10:15:32 +10:00
Tim Bell 8f8a78bb88 Remove now-unused csort() 2018-04-11 09:37:32 +10:00
Tim Bell 2dda2b12b4 Speedup layout with .sort() and sortedcontainers.SortedListWithKey() 2018-04-11 09:03:32 +10:00
Gregory Mori 335c25c045 only check for bytes input to enc() in python3
In python2, isinstance("", bytes) is true, causing enc() to
suppress any string input. This results in fontnames being lost
when running pdf2txt.py in python2.

As this check was not present in the original python2 version of
pdfminer, restrict it to only check when running in python3.
2018-04-09 12:21:59 -07:00
Andy Kluger ed7d8308d9 -P is *not* for page numbers, but passwords, so reflect that in the help text 2018-04-03 12:26:01 -04:00
Tim Bell 981e3a575e Fix TypeError caused by bug in _parse_comment; #90 #89 #109 2018-04-03 12:47:40 +10:00