Yusuke Shinyama
|
557c2c72e6
|
Removed ObjIdRange for terseness.
|
2013-10-10 18:34:43 +09:00 |
Yusuke Shinyama
|
2221163b94
|
Split pdfparser.py and pdfdocument.py.
|
2013-10-10 18:29:30 +09:00 |
Yusuke Shinyama
|
1467fc674c
|
Added fallback for broken PDFs.
|
2013-10-09 22:45:54 +09:00 |
Yusuke Shinyama
|
eabe72ee63
|
Prevent crash with empty layout box.
|
2013-10-09 22:13:22 +09:00 |
Yusuke Shinyama
|
87143cb36f
|
Fallback when /Pages does not exist.
|
2013-10-09 22:08:16 +09:00 |
Yusuke Shinyama
|
06425bba00
|
Introducing PDFObjectNotFound
|
2013-10-09 21:39:23 +09:00 |
Yusuke Shinyama
|
3c3cba2ecc
|
Moved: import PIL.
|
2013-04-09 18:42:32 +09:00 |
Yusuke Shinyama
|
19e7d70ac1
|
Merge pull request #15 from jcushman/patch-1
2x faster layout analysis: Use set instead of list for Plane's internal collection of objects.
|
2013-04-09 02:39:46 -07:00 |
Yusuke Shinyama
|
4faccff9c9
|
Merge pull request #16 from jcushman/master
2x faster group_textboxes function.
|
2013-04-09 01:58:56 -07:00 |
Yusuke Shinyama
|
d8bc13b3af
|
Merge pull request #13 from gendoc/master
PDFDocument.lookup_name.lookup isn't searching for 'Names' key.
|
2013-04-09 01:55:54 -07:00 |
jcushman
|
f77f196cd3
|
2x faster group_textboxes function.
|
2012-06-22 18:11:45 -03:00 |
jcushman
|
da3f023b2d
|
Use set instead of list for Plane's internal collection of objects.
|
2012-06-22 16:36:33 -03:00 |
Humberto Pereira
|
89c81db295
|
PDFDocument.lookup_names.lookup didn't find 'Names' in some files
|
2012-03-19 16:42:58 -03:00 |
Jim Morrison
|
6413eb7de4
|
Deal with CMYK images by converting them to RGB. PIL does not invert CMYK images as of PIL 1.1.7, so the invert happens in ImageWriter.
|
2012-01-24 16:18:36 -08:00 |
Yusuke Shinyama
|
c7709045e9
|
fixed: invalid bmp file output
|
2011-11-08 00:29:24 +10:00 |
Yusuke Shinyama
|
82ff98c7b3
|
imagewriter now works with text output
|
2011-11-07 01:15:10 +10:00 |
Yusuke Shinyama
|
91174b5665
|
avoid crash when colorspace is null.
|
2011-11-06 20:10:48 +10:00 |
Yusuke Shinyama
|
3d1652963a
|
Merge github.com:euske/pdfminer
|
2011-10-30 15:44:49 +10:00 |
dwilson
|
60dbf6bb69
|
avoids crash in pdf syntax error for missing ids
when an object id is out of range, rather than crashing, only raise a
pdf syntax error if STRICT is enabled and return None otherwise
|
2011-08-31 17:03:10 -04:00 |
Yusuke Shinyama
|
f638784e1d
|
experimental layout analysis improvements
|
2011-08-14 09:44:21 +09:00 |
Yusuke Shinyama
|
cbb8d869c7
|
removed initial cmap/ directory
|
2011-07-31 18:05:07 +10:00 |
Yusuke Shinyama
|
cdef0d7883
|
Merge github.com:euske/pdfminer
|
2011-07-31 17:47:20 +10:00 |
Yusuke Shinyama
|
46bb0107aa
|
fixed: crash due to small layout elements (thanks to hsoft)
|
2011-07-31 17:44:09 +10:00 |
Yusuke Shinyama
|
eec317ae10
|
Merge pull request #6 from rsennrich/master
cleaner widths for Adobe core 14 fonts. (thanks to rsennrich)
|
2011-07-31 00:39:36 -07:00 |
Yusuke Shinyama
|
24cd161fb7
|
CCITTFaxFilter.reversed fix
|
2011-07-31 17:36:02 +10:00 |
Rico
|
6e4f36d9a1
|
get width based on utf-8 char.
fills some gaps and fixes inconsistencies between standard encodings
|
2011-07-23 16:34:11 +02:00 |
Yusuke Shinyama
|
dc8fde0e47
|
added CCITTFaxFilter support and a very crude image extraction.
|
2011-07-18 21:07:00 +10:00 |
Yusuke Shinyama
|
2707ba75df
|
added CCITTFaxFilter support and a very crude image extraction.
|
2011-07-18 21:06:50 +10:00 |
Yusuke Shinyama
|
fda6f7ba5d
|
ccitt.py added.
|
2011-07-18 17:36:37 +10:00 |
Yusuke Shinyama
|
0278076ea8
|
PNG predictor added
|
2011-06-07 00:46:33 +09:00 |
Yusuke Shinyama
|
18a5058af6
|
separated predictor functions.
|
2011-06-07 00:31:03 +09:00 |
Yusuke Shinyama
|
170c97a12b
|
colorspace patch by Lieb Simon
|
2011-06-06 17:10:12 +09:00 |
Yusuke Shinyama
|
2e8180ddee
|
documentation update and version bump
|
2011-05-15 01:37:14 +09:00 |
Yusuke Shinyama
|
c134596e2f
|
code cleanup and testcase stabilization
|
2011-05-15 01:22:19 +09:00 |
Yusuke Shinyama
|
e5d02f8653
|
fixed the infinite recursion bug.
|
2011-05-14 16:32:09 +09:00 |
Yusuke Shinyama
|
0c41b8348e
|
code cleanup
|
2011-05-14 15:51:40 +09:00 |
Yusuke Shinyama
|
038ce4cd0c
|
added LTText.get_text() and .text property is no longer accessible.
|
2011-05-14 15:45:08 +09:00 |
Yusuke Shinyama
|
5004e4b28d
|
layout analysis speedup.
|
2011-05-14 14:17:39 +09:00 |
Yusuke Shinyama
|
095534b294
|
figure object now does not call analyze.
|
2011-05-14 14:17:22 +09:00 |
Yusuke Shinyama
|
b8d516fc52
|
extended Plane class.
|
2011-05-14 14:16:40 +09:00 |
Yusuke Shinyama
|
fcf0d74ecc
|
tweaks for debugging
|
2011-04-21 22:07:52 +09:00 |
Yusuke Shinyama
|
8f9684f6a6
|
code cleanup: layout analysis
|
2011-04-21 22:07:04 +09:00 |
Yusuke Shinyama
|
0e660dd385
|
rename: LTPolygon -> LTCurve
|
2011-04-20 22:05:25 +09:00 |
Yusuke Shinyama
|
dab70855bf
|
LTLine is now strictly horizontal or vertical.
|
2011-04-20 22:01:54 +09:00 |
Jonathan J Hunt
|
ec682539da
|
Optimized memory usage in TextConverter by ignoring all drawing commands.
|
2011-03-07 15:11:31 +10:00 |
Yusuke Shinyama
|
4918d59bc2
|
disable caching support
|
2011-03-03 00:04:43 +09:00 |
Yusuke Shinyama
|
18e782f330
|
canonicalize package names
|
2011-03-02 23:43:03 +09:00 |
Yusuke Shinyama
|
bb26cf9180
|
eliminate empty textboxes
|
2011-03-01 20:47:20 +09:00 |
Yusuke Shinyama
|
dfd621b98c
|
minor bugfix. thanks to Hiroshi Manabe.
|
2011-02-28 19:50:07 +09:00 |
Yusuke Shinyama
|
f22b056454
|
release-20110227
|
2011-02-27 19:53:12 +09:00 |