unknown
|
29c07ea770
|
Python 3.4 support and tests
|
2014-09-03 15:26:08 +02:00 |
Yusuke Shinyama
|
8791355e1d
|
Cleanup imports. Use relative imports.
|
2014-06-26 18:12:39 +09:00 |
Yusuke Shinyama
|
44074b42ea
|
Added: stripcontrol for XMLConverter (-S option)
|
2014-06-22 00:33:00 +09:00 |
Yusuke Shinyama
|
1384a3fe8d
|
Code cleanup: removed some debug flags.
|
2014-06-14 15:43:10 +09:00 |
Yusuke Shinyama
|
8e14ebf4e1
|
Use logging module instead of print.
|
2014-06-14 12:00:49 +09:00 |
Yusuke Shinyama
|
2b56b2eedf
|
Merged.
|
2013-11-07 19:50:41 +09:00 |
Matthew Duggan
|
2caa5edc25
|
PEP8: Whitespace changes to match pep8
|
2013-11-07 17:35:04 +09:00 |
Matthew Duggan
|
c1da8b835c
|
PEP8: Remove trailing whitespace
|
2013-11-07 16:14:53 +09:00 |
Matthew Duggan
|
10a68c83bd
|
Remove unused imports identified by pyflakes
|
2013-11-07 16:09:44 +09:00 |
Yusuke Shinyama
|
02ad086f6a
|
fixed: HTMLConverter.
|
2013-10-25 18:10:40 +09:00 |
Yusuke Shinyama
|
0ea08890d4
|
renamed: python2 -> python.
|
2013-10-17 23:05:27 +09:00 |
Yusuke Shinyama
|
82ff98c7b3
|
imagewriter now works with text output
|
2011-11-07 01:15:10 +10:00 |
Yusuke Shinyama
|
dc8fde0e47
|
added CCITTFaxFilter support and a very crude image extraction.
|
2011-07-18 21:07:00 +10:00 |
Yusuke Shinyama
|
170c97a12b
|
colorspace patch by Lieb Simon
|
2011-06-06 17:10:12 +09:00 |
Yusuke Shinyama
|
0c41b8348e
|
code cleanup
|
2011-05-14 15:51:40 +09:00 |
Yusuke Shinyama
|
038ce4cd0c
|
added LTText.get_text() and .text property is no longer accessible.
|
2011-05-14 15:45:08 +09:00 |
Yusuke Shinyama
|
095534b294
|
figure object now does not call analyze.
|
2011-05-14 14:17:22 +09:00 |
Yusuke Shinyama
|
0e660dd385
|
rename: LTPolygon -> LTCurve
|
2011-04-20 22:05:25 +09:00 |
Yusuke Shinyama
|
dab70855bf
|
LTLine is now strictly horizontal or vertical.
|
2011-04-20 22:01:54 +09:00 |
Jonathan J Hunt
|
ec682539da
|
Optimized memory usage in TextConverter by ignoring all drawing commands.
|
2011-03-07 15:11:31 +10:00 |
Yusuke Shinyama
|
7dbb664db3
|
code cleanup and more debugging options
|
2011-02-14 23:42:05 +09:00 |
Yusuke Shinyama
|
b2d13db29a
|
code cleanup
|
2011-02-14 22:51:20 +09:00 |
Yusuke Shinyama
|
4eb6083c09
|
code cleanup
|
2011-01-03 18:11:22 +09:00 |
Yusuke Shinyama
|
3da3adad9b
|
method renamed: finish(self) -> analyze(self, laparams).
|
2010-12-26 16:56:21 +09:00 |
yusuke.shinyama.dummy
|
84ed94aec0
|
another bugfix
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@281 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:41:03 +00:00 |
yusuke.shinyama.dummy
|
9bba7ac08b
|
oops, forgot to fix this
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@280 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:40:58 +00:00 |
yusuke.shinyama.dummy
|
9f78915ea6
|
show cid for unknown characters
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@275 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-23 10:53:19 +00:00 |
yusuke.shinyama.dummy
|
7374b81383
|
htmlconverter improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@274 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 15:04:28 +00:00 |
yusuke.shinyama.dummy
|
fb4ce96309
|
add font-family
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@273 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 10:07:50 +00:00 |
yusuke.shinyama.dummy
|
476ecf7e32
|
add html exect layout mode; default changed.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@272 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 10:07:41 +00:00 |
yusuke.shinyama.dummy
|
9584845358
|
layout analysis improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@268 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-09 10:40:05 +00:00 |
yusuke.shinyama.dummy
|
edbd3764a7
|
html layout output fix
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@267 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-09 10:39:48 +00:00 |
yusuke.shinyama.dummy
|
509ab66319
|
stay with python2
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@264 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-19 09:57:01 +00:00 |
yusuke.shinyama.dummy
|
cc139db8a7
|
bugfix LTChar.is_vertical undefined. verticality is now handled by LTTextBox
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@254 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-17 05:15:23 +00:00 |
yusuke.shinyama.dummy
|
3305c07ba2
|
layout analysis improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@245 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-17 05:13:39 +00:00 |
yusuke.shinyama.dummy
|
cf52476f5e
|
remove redundancy
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@221 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-06-06 05:16:21 +00:00 |
yusuke.shinyama.dummy
|
fe3bdbfce0
|
text rise support added
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@217 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-05-18 14:57:04 +00:00 |
yusuke.shinyama.dummy
|
eb535d4106
|
change PDFPageAggregator -> PDFLayoutAnalyzer
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@213 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:21 +00:00 |
yusuke.shinyama.dummy
|
833f859449
|
move TagExtractor
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@212 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:11 +00:00 |
yusuke.shinyama.dummy
|
97848409e5
|
fix xobject resources bug, thanks to Jose Maria
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@209 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 04:32:03 +00:00 |
yusuke.shinyama.dummy
|
e77a6ba997
|
-A (all_texts) option added for layout analysis
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@205 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:30:03 +00:00 |
yusuke.shinyama.dummy
|
609c6e1f5f
|
rename: LayoutItem -> LTItem, LayoutContainer -> LTContainer
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@203 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:29:30 +00:00 |
yusuke.shinyama.dummy
|
c81142aa44
|
image handling addition (untested)
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@202 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-10 11:05:02 +00:00 |
yusuke.shinyama.dummy
|
5f822f6dcb
|
improved layout analysis.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@197 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-26 11:11:35 +00:00 |
yusuke.shinyama.dummy
|
cd39642abe
|
code cleanup
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@188 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-22 04:00:18 +00:00 |
yusuke.shinyama.dummy
|
e01cb43e31
|
add novel layout analysis
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@187 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-21 02:21:37 +00:00 |
yusuke.shinyama.dummy
|
ffaaea0bac
|
layout analysis changed drastically.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@186 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-03-20 05:43:34 +00:00 |
yusuke.shinyama.dummy
|
23be96c49e
|
CAUTION! changed the way of internal layout handling.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@184 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-27 03:59:25 +00:00 |
yusuke.shinyama.dummy
|
2555b38836
|
fix typos (patches by sm)
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@183 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-15 14:50:19 +00:00 |
yusuke.shinyama.dummy
|
0424fd8dc9
|
incorporated some patches by Andre Auzi
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@180 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-02-07 15:11:24 +00:00 |