Martin Wolf
26f80715ed
Merge remote-tracking branch 'upstream/master'
2018-06-20 13:27:18 +02:00
Tata Ganesh
ac8bb81c72
DOCS: Update Readme.md
2018-06-17 22:37:32 +05:30
Tata Ganesh
67bc581bd3
Merge pull request #134 from timb07/issue_90
...
FIX: TypeError caused by bug in _parse_comment; #90 #89 #109
2018-06-14 09:27:34 +05:30
Tata Ganesh
7084d81bd1
Merge pull request #129 from clustree/xml-color
...
FEAT: Send color to XML conversion
2018-06-10 21:02:34 +05:30
Tata Ganesh
72d6e93045
FIX: Removed python 2.6 build
...
- python 2.6 is being no longer supported, and some of the PRs are failing the python 2.6 check
2018-06-10 19:52:33 +05:30
Tata Ganesh
653b4f33ee
Merge pull request #139 from gvmori/py2fontfix
...
FIX: Only check for bytes input to enc() in python3
Note - For python2, no checks have been put in place
2018-06-10 16:51:23 +05:30
Tata Ganesh
5550b6a7df
Merge pull request #135 from AndydeCleyre/bugfix/big_P_is_for_Password
...
FIX: -P is not for page numbers, but passwords; reflect that in the help text
2018-06-10 16:05:09 +05:30
Martin Wolf
4bdb3ba8cc
Fixes needed to be able to compile pdfminer.six with Cython
2018-04-12 00:05:38 +02:00
Tim Bell
1cbeaebfce
Fix Python 2.6 incompatibility
2018-04-11 10:34:15 +10:00
Tim Bell
f87bd1f171
Add sortedcontainers to TravisCI config
2018-04-11 10:18:52 +10:00
Tim Bell
0c8cf748fe
Fix copy-paste error
2018-04-11 10:15:32 +10:00
Tim Bell
8f8a78bb88
Remove now-unused csort()
2018-04-11 09:37:32 +10:00
Tim Bell
2dda2b12b4
Speedup layout with .sort() and sortedcontainers.SortedListWithKey()
2018-04-11 09:03:32 +10:00
Gregory Mori
335c25c045
only check for bytes input to enc() in python3
...
In python2, isinstance("", bytes) is true, causing enc() to
suppress any string input. This results in fontnames being lost
when running pdf2txt.py in python2.
As this check was not present in the original python2 version of
pdfminer, restrict it to only check when running in python3.
2018-04-09 12:21:59 -07:00
Andy Kluger
ed7d8308d9
-P is *not* for page numbers, but passwords, so reflect that in the help text
2018-04-03 12:26:01 -04:00
Tim Bell
981e3a575e
Fix TypeError caused by bug in _parse_comment; #90 #89 #109
2018-04-03 12:47:40 +10:00
Tim Bell
083f11b165
Fix cases where a bytearray doesn't work in place of bytes
2018-04-03 07:27:29 +10:00
Tim Bell
185ddeb2ab
Speed up handling of PDFs with large images with more minimal change
2018-04-03 07:21:21 +10:00
Tim Bell
fab1c9462c
Speed up handling of PDFs with large images
2018-03-29 14:21:31 +11:00
Tata Ganesh
eddf861fbd
Merge pull request #125 from yosida95/bytes-type
...
Fix type of an argument to PDFFont#decode to bytes in py3
2018-03-19 11:00:10 +05:30
Tata Ganesh
a66f1285ba
Merge pull request #124 from yosida95/except-stmt
...
Use except Exception as e clause for Python 3 compatibility
2018-03-19 10:44:35 +05:30
Tata Ganesh
fa9ccea38f
Merge pull request #121 from KOLANICH/2BoxVector
...
Added a vector between 2 boxes which may be useful for users of the library
2018-03-19 10:44:02 +05:30
Quentin Pradet
0911703eba
pdfcolor: Fix Python 2.6 compatibility
2018-03-06 14:53:11 +04:00
Quentin Pradet
94f3d61bb2
converter: Fix XML syntax
2018-03-06 14:41:52 +04:00
Quentin Pradet
2231f0892e
Send non-stroke color to XML conversion
...
Inspired by https://github.com/euske/pdfminer/pull/158 from @andruo11
and https://github.com/euske/pdfminer/pull/197 from @staccatosound.
2018-03-06 14:11:48 +04:00
Quentin Pradet
b6c63bedc6
Make DeviceGray the default color as it should be
2018-03-06 11:24:07 +04:00
Quentin Pradet
0ce9a29f83
Fix colorspace determinism with OrderedDict
2018-03-06 11:23:32 +04:00
Kohei YOSHIDA
a636cbcfd4
fix type of an argument to PDFFont#decode to bytes in py3
2018-02-20 13:42:09 +09:00
Kohei YOSHIDA
baf3cd0c2c
use except Exception as e clause for Python 3 compatibility
2018-02-19 23:32:36 +09:00
KOLANICH
3bf3c97bbb
Added a vector between 2 boxes which may be useful for users of the library
2018-02-16 14:49:12 +00:00
Tata Ganesh
3e6cc20cb2
Merge pull request #96 from sschuberth/patch-1
...
TrueTypeFont: Check for enough data to unpack
2018-01-31 18:26:54 +05:30
Tata Ganesh
27abd17711
Merge pull request #106 from oculushut/master
...
Minor change to README file
2017-12-20 11:19:34 +05:30
oculushut
46d6e571eb
Update README.md
...
Adds specific location for HTML file containing more information on dumppdf.py command line tool.
2017-12-20 00:23:00 +00:00
oculushut
125bae23cc
Update README.md
...
Adds specific location for location of html file with more information for Command Line Tools pdf2txt.py.
2017-12-20 00:20:46 +00:00
Guglielmetti Philippe
6d3210d206
pdfdiff tool (and .spec files for compilation with pyinstaller)
2017-11-21 10:48:45 +01:00
ganeshtata
1b88575e79
FIX: Null character replaced by blank
...
-The presence of the character '\0' was causing an error with some PDFs.
-It has been fixed by replacing all occurences of '\0' with ''.
2017-11-08 12:50:50 +05:30
Sebastian Schuberth
fcd3e6ce00
Catch an error unpack might throw instead of checking the length before
2017-10-30 19:31:58 +01:00
Sebastian Schuberth
ec8530f6cf
Add a test for the previous fix
2017-10-16 12:35:16 +02:00
Sebastian Schuberth
39428fb4f0
TrueTypeFont: Check for enough data to unpack
...
Fixes https://github.com/euske/pdfminer/issues/96
and https://github.com/euske/pdfminer/issues/144 .
2017-10-16 12:35:04 +02:00
SUZUKI Masaya
d4118cf5e8
Enabled PDFDevice in the with statement ( #88 )
2017-08-18 08:15:04 +02:00
Peter Bittner
e39800f14c
Move package description into package docstring ( #87 )
...
Convert Windows/DOS line endings CR/LF to Unix LF (again!)
Add Python 3.6 to classifiers, update project URL
2017-08-18 08:13:15 +02:00
Venelin Stoykov
5ef5484bbe
Add tox configuration for easy local testing ( #85 )
2017-08-18 08:11:32 +02:00
Venelin Stoykov
171cdcc69d
Microoptimization for singlebyte fonts ( #84 )
...
Instead of list comprehension which will call a function to get the integer value of the bytes directly convert it to bytearray which is more optimal structure for storing list of bytes.
2017-08-18 08:10:27 +02:00
Venelin Stoykov
14de393d5e
Cleanup psparser ( #83 )
...
- Do not use bytesindex function. Use native slices instead
- Fix import ordering
2017-08-18 08:10:06 +02:00
Venelin Stoykov
496bfd0778
Remove leftover from removing shebangs ( #81 )
2017-08-18 08:09:00 +02:00
Venelin Stoykov
c2432c32f1
Fix assert message for PDFLayoutAnalyzer.end_page ( #80 )
...
stack is undefined
2017-08-18 08:08:08 +02:00
Philippe Guglielmetti
4c604828e8
v. 20170720
2017-07-20 21:35:49 +02:00
Philippe Guglielmetti
b010db6049
solves https://github.com/pdfminer/pdfminer.six/issues/65
2017-07-20 21:17:06 +02:00
Sergei Maertens
67bf5ab124
Compare byte with byte instead of int ( #78 )
2017-07-20 20:47:14 +02:00
Sergei Maertens
3e364354da
Fixes #64 -- be less strict when inspecting a tree type ( #76 )
...
In the PDFStream it's possible that the /Type element is not
present, but /type is. According to the spec, these are different
elements, but in the case in point they had the same meaning.
If PDFMiner is not running in STRICT mode and /Type doesn't resolve,
a fallback to /type is used to determine the tree type.
2017-07-20 20:46:35 +02:00