Yusuke Shinyama
|
fe86b4e64e
|
Changed: StringIO -> io.BytesIO
|
2014-06-25 19:55:41 +09:00 |
Yusuke Shinyama
|
44074b42ea
|
Added: stripcontrol for XMLConverter (-S option)
|
2014-06-22 00:33:00 +09:00 |
Yusuke Shinyama
|
bb866ae148
|
Changed: new except syntax (2.6 or above).
|
2014-06-16 18:50:07 +09:00 |
Yusuke Shinyama
|
28e96ba3d0
|
Use print as a function.
|
2014-06-15 12:14:33 +09:00 |
Yusuke Shinyama
|
1384a3fe8d
|
Code cleanup: removed some debug flags.
|
2014-06-14 15:43:10 +09:00 |
Yusuke Shinyama
|
17b9b19a26
|
Fixed for newer version: pdf2html.cgi
|
2014-04-02 18:54:50 +09:00 |
Yusuke Shinyama
|
340387bfc6
|
Cleanup: isinstance
|
2014-03-28 17:50:59 +09:00 |
Yusuke Shinyama
|
f9079e4c0a
|
Fixed dumppdf.py issues.
|
2014-03-24 20:55:00 +09:00 |
Yusuke Shinyama
|
bb6f9b6fc9
|
Added: -R option.
|
2013-11-25 18:21:19 +09:00 |
Alex Rothberg
|
af8c4a6b8f
|
- only visit each objid once when dumping all objects
|
2013-11-18 20:41:09 -05:00 |
Yusuke Shinyama
|
2b56b2eedf
|
Merged.
|
2013-11-07 19:50:41 +09:00 |
Matthew Duggan
|
c1da8b835c
|
PEP8: Remove trailing whitespace
|
2013-11-07 16:14:53 +09:00 |
Matthew Duggan
|
10a68c83bd
|
Remove unused imports identified by pyflakes
|
2013-11-07 16:09:44 +09:00 |
Yusuke Shinyama
|
d3730a29ec
|
API change: process_pdf -> PDFPage.get_pages
|
2013-10-22 18:59:16 +09:00 |
Yusuke Shinyama
|
8a70a9f657
|
fixed: encoding problem with vertical characters.
|
2013-10-22 18:44:40 +09:00 |
Yusuke Shinyama
|
32844507ea
|
Fixed some style issues.
|
2013-10-19 08:41:01 +09:00 |
Yusuke Shinyama
|
28cb424f8f
|
Merge pull request #21 from eug48/master
dumppdf: support for extracting embedded files using the -E option
|
2013-10-18 16:23:09 -07:00 |
Yusuke Shinyama
|
6ca9ac5434
|
chmod fix.
|
2013-10-17 23:06:07 +09:00 |
Yusuke Shinyama
|
0ea08890d4
|
renamed: python2 -> python.
|
2013-10-17 23:05:27 +09:00 |
Yusuke Shinyama
|
6ad82e355c
|
Beating the codepage dragon.
|
2013-10-17 22:57:48 +09:00 |
Yusuke Shinyama
|
774827b4ce
|
Code cleanup: conv_cmap.py
|
2013-10-12 13:20:40 +09:00 |
Yusuke Shinyama
|
f85c374cae
|
Separated PDFPage to pdfpage.py.
|
2013-10-10 19:54:55 +09:00 |
Yusuke Shinyama
|
c926874d20
|
API Change: the PDFDocument cstr now takes PDFParser. set_parser() is removed.
|
2013-10-10 18:40:06 +09:00 |
Yusuke Shinyama
|
2221163b94
|
Split pdfparser.py and pdfdocument.py.
|
2013-10-10 18:29:30 +09:00 |
Yusuke Shinyama
|
1467fc674c
|
Added fallback for broken PDFs.
|
2013-10-09 22:45:54 +09:00 |
Yusuke Shinyama
|
06425bba00
|
Introducing PDFObjectNotFound
|
2013-10-09 21:39:23 +09:00 |
eug
|
925845b172
|
dumppdf: support for extracting embedded files using the -E option
|
2013-01-20 13:29:35 +10:00 |
Yusuke Shinyama
|
82ff98c7b3
|
imagewriter now works with text output
|
2011-11-07 01:15:10 +10:00 |
Yusuke Shinyama
|
dc8fde0e47
|
added CCITTFaxFilter support and a very crude image extraction.
|
2011-07-18 21:07:00 +10:00 |
Yusuke Shinyama
|
fcf0d74ecc
|
tweaks for debugging
|
2011-04-21 22:07:52 +09:00 |
Yusuke Shinyama
|
4918d59bc2
|
disable caching support
|
2011-03-03 00:04:43 +09:00 |
Yusuke Shinyama
|
7dbb664db3
|
code cleanup and more debugging options
|
2011-02-14 23:42:05 +09:00 |
Yusuke Shinyama
|
cbd58121e3
|
fix aggressive vertical writing detection (which ruins layout)
|
2011-02-02 23:09:34 +09:00 |
Yusuke Shinyama
|
d3bcc0eef5
|
another minor fix
|
2010-12-26 19:30:46 +09:00 |
Yusuke Shinyama
|
a24c452ba2
|
boxes_flow patch by Daniel Gerber
|
2010-12-26 17:26:39 +09:00 |
Yusuke Shinyama
|
bf44e52cf7
|
merged
|
2010-12-25 17:54:17 +09:00 |
yusuke.shinyama.dummy
|
866f2bbb75
|
webapp fixed
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@283 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:41:35 +00:00 |
yusuke.shinyama.dummy
|
5d98a27d9c
|
test cases updated
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@282 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-12-25 08:41:11 +00:00 |
Yusuke Shinyama
|
432b3829d3
|
test cases updated
|
2010-12-24 22:30:25 +09:00 |
yusuke.shinyama.dummy
|
2bf9c23801
|
check_extractable paramater added
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@276 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-23 10:53:28 +00:00 |
yusuke.shinyama.dummy
|
7374b81383
|
htmlconverter improved
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@274 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-11-14 15:04:28 +00:00 |
yusuke.shinyama.dummy
|
509ab66319
|
stay with python2
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@264 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-19 09:57:01 +00:00 |
yusuke.shinyama.dummy
|
afe33312c6
|
outline bug fixed
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@249 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-10-17 05:14:52 +00:00 |
yusuke.shinyama.dummy
|
ca5588a702
|
bugfix by Humberto Pereira
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@241 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-08-29 06:59:50 +00:00 |
yusuke.shinyama.dummy
|
4554705881
|
glyphlist bug (due to my misunderstanding of spec.)
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@237 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-08-26 15:02:46 +00:00 |
yusuke.shinyama.dummy
|
a0dd46bd8e
|
cmap compression patch. thanks to Jakub Wilk
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@228 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-06-13 13:50:24 +00:00 |
yusuke.shinyama.dummy
|
f9c9357547
|
pdf2html.cgi code cleanup
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@218 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-05-29 11:51:15 +00:00 |
yusuke.shinyama.dummy
|
8e92ddca30
|
latin2ascii.py was moved as a utility
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@215 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-05-05 05:51:11 +00:00 |
yusuke.shinyama.dummy
|
eb535d4106
|
change PDFPageAggregator -> PDFLayoutAnalyzer
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@213 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:21 +00:00 |
yusuke.shinyama.dummy
|
32d65b70f8
|
trivial change
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@211 1aa58f4a-7d42-0410-adbc-911cccaed67c
|
2010-04-24 13:31:03 +00:00 |