Yusuke Shinyama
|
dc8fde0e47
|
added CCITTFaxFilter support and a very crude image extraction.
|
2011-07-18 21:07:00 +10:00 |
Yusuke Shinyama
|
170c97a12b
|
colorspace patch by Lieb Simon
|
2011-06-06 17:10:12 +09:00 |
Yusuke Shinyama
|
0c41b8348e
|
code cleanup
|
2011-05-14 15:51:40 +09:00 |
Yusuke Shinyama
|
038ce4cd0c
|
added LTText.get_text() and .text property is no longer accessible.
|
2011-05-14 15:45:08 +09:00 |
Yusuke Shinyama
|
095534b294
|
figure object now does not call analyze.
|
2011-05-14 14:17:22 +09:00 |
Yusuke Shinyama
|
0e660dd385
|
rename: LTPolygon -> LTCurve
|
2011-04-20 22:05:25 +09:00 |
Yusuke Shinyama
|
dab70855bf
|
LTLine is now strictly horizontal or vertical.
|
2011-04-20 22:01:54 +09:00 |
Jonathan J Hunt
|
ec682539da
|
Optimized memory usage in TextConverter by ignoring all drawing commands.
|
2011-03-07 15:11:31 +10:00 |
Yusuke Shinyama
|
7dbb664db3
|
code cleanup and more debugging options
|
2011-02-14 23:42:05 +09:00 |
Yusuke Shinyama
|
b2d13db29a
|
code cleanup
|
2011-02-14 22:51:20 +09:00 |
Yusuke Shinyama
|
4eb6083c09
|
code cleanup
|
2011-01-03 18:11:22 +09:00 |
Yusuke Shinyama
|
3da3adad9b
|
method renamed: finish(self) -> analyze(self, laparams).
|
2010-12-26 16:56:21 +09:00 |
yusuke.shinyama.dummy
|
84ed94aec0
|
another bugfix
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@281 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:41:03 +00:00 |
yusuke.shinyama.dummy
|
9bba7ac08b
|
oops, forgot to fix this
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@280 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:40:58 +00:00 |
yusuke.shinyama.dummy
|
9f78915ea6
|
show cid for unknown characters
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@275 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-23 10:53:19 +00:00 |
yusuke.shinyama.dummy
|
7374b81383
|
htmlconverter improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@274 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 15:04:28 +00:00 |
yusuke.shinyama.dummy
|
fb4ce96309
|
add font-family
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@273 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 10:07:50 +00:00 |
yusuke.shinyama.dummy
|
476ecf7e32
|
add html exect layout mode; default changed.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@272 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 10:07:41 +00:00 |
yusuke.shinyama.dummy
|
9584845358
|
layout analysis improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@268 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-09 10:40:05 +00:00 |
yusuke.shinyama.dummy
|
edbd3764a7
|
html layout output fix
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@267 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-09 10:39:48 +00:00 |
yusuke.shinyama.dummy
|
509ab66319
|
stay with python2
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@264 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-19 09:57:01 +00:00 |
yusuke.shinyama.dummy
|
cc139db8a7
|
bugfix LTChar.is_vertical undefined. verticality is now handled by LTTextBox
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@254 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-17 05:15:23 +00:00 |
yusuke.shinyama.dummy
|
3305c07ba2
|
layout analysis improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@245 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-17 05:13:39 +00:00 |
yusuke.shinyama.dummy
|
cf52476f5e
|
remove redundancy
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@221 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-06-06 05:16:21 +00:00 |
yusuke.shinyama.dummy
|
fe3bdbfce0
|
text rise support added
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@217 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-05-18 14:57:04 +00:00 |
yusuke.shinyama.dummy
|
eb535d4106
|
change PDFPageAggregator -> PDFLayoutAnalyzer
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@213 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:21 +00:00 |
yusuke.shinyama.dummy
|
833f859449
|
move TagExtractor
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@212 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:11 +00:00 |
yusuke.shinyama.dummy
|
97848409e5
|
fix xobject resources bug, thanks to Jose Maria
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@209 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 04:32:03 +00:00 |
yusuke.shinyama.dummy
|
e77a6ba997
|
-A (all_texts) option added for layout analysis
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@205 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:30:03 +00:00 |
yusuke.shinyama.dummy
|
609c6e1f5f
|
rename: LayoutItem -> LTItem, LayoutContainer -> LTContainer
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@203 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:29:30 +00:00 |
yusuke.shinyama.dummy
|
c81142aa44
|
image handling addition (untested)
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@202 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:05:02 +00:00 |
yusuke.shinyama.dummy
|
5f822f6dcb
|
improved layout analysis.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@197 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-26 11:11:35 +00:00 |
yusuke.shinyama.dummy
|
cd39642abe
|
code cleanup
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@188 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-22 04:00:18 +00:00 |
yusuke.shinyama.dummy
|
e01cb43e31
|
add novel layout analysis
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@187 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-21 02:21:37 +00:00 |
yusuke.shinyama.dummy
|
ffaaea0bac
|
layout analysis changed drastically.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@186 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-20 05:43:34 +00:00 |
yusuke.shinyama.dummy
|
23be96c49e
|
CAUTION! changed the way of internal layout handling.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@184 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-27 03:59:25 +00:00 |
yusuke.shinyama.dummy
|
2555b38836
|
fix typos (patches by sm)
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@183 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-15 14:50:19 +00:00 |
yusuke.shinyama.dummy
|
0424fd8dc9
|
incorporated some patches by Andre Auzi
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@180 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-07 15:11:24 +00:00 |
yusuke.shinyama.dummy
|
538a605ac0
|
several bugfixes.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@179 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-07 03:14:00 +00:00 |
yusuke.shinyama.dummy
|
dda60dcafc
|
integrate TODO html.
reorder the code bit.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@177 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-01-31 02:12:51 +00:00 |
yusuke.shinyama.dummy
|
0f8fe3f19e
|
Page rotation bug fixed.
Various minor fixes.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@176 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-01-31 02:09:28 +00:00 |
yusuke.shinyama.dummy
|
dc6e5c366d
|
jpeg extraction support added.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@174 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-01-30 07:30:01 +00:00 |
yusuke.shinyama.dummy
|
a63d0324ed
|
version bump
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@171 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-01-04 12:50:59 +00:00 |
yusuke.shinyama.dummy
|
6590ad42f5
|
experimental polygon extraction.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@166 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-12-20 02:38:01 +00:00 |
yusuke.shinyama.dummy
|
e4b089e327
|
include cmap
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@162 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-12-19 14:17:00 +00:00 |
yusuke.shinyama.dummy
|
77986b8273
|
fix CMapDB initialization stuff. more code cleanup.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@148 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-11-03 13:39:34 +00:00 |
yusuke.shinyama.dummy
|
3dd4f1668b
|
source code tidy up
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@147 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-11-03 01:27:30 +00:00 |
yusuke.shinyama.dummy
|
78f7866554
|
sgml to xml
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@146 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-10-31 03:04:56 +00:00 |
yusuke.shinyama.dummy
|
7790808560
|
to 4-space indentation
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@142 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-10-24 04:41:59 +00:00 |
yusuke.shinyama.dummy
|
3da04c0a04
|
rectangle handling bug fixed
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@132 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2009-09-12 02:37:47 +00:00 |