Attila Szász
938419c476
Align dumppdf tool to modified data structures. ( #73 )
...
* Align dumppdf tool to modified data structures.
TOC page numbers should also work now, counting from 1.
* Update version number.
2017-07-20 20:46:11 +02:00
Antonio Ercole De Luca
0fdebc6739
Removing all the "#!/usr/bin/env python" lines, they do not need for … ( #34 )
...
* Removing all the "#!/usr/bin/env python" lines, they do not need for python3, solving issue number: #19 .
* Restored all the shebangs in the tools and tests folders (because they are real executables) but used "#!/usr/bin/env python" instead of "#!/usr/bin/python" as this blog points out: https://www.peterbe.com/plog/importance-of-env
Removed also the shebang from pdfminer/psparser.py file.
2016-11-08 20:01:11 +01:00
cybjit
2639b15ef4
guess argv encoding in py2 using sys.stdin.encoding
2014-09-16 23:17:26 +02:00
cybjit
14585987c3
keep password api unicode, latin1 or utf-8 is encoded in handler
2014-09-16 22:58:25 +02:00
cybjit
714423883c
setup logging for pdf2txt and fix dumppdf
2014-09-12 00:29:31 +02:00
unknown
28c2a4e6ad
2.7/3.4 encoding corrected
2014-09-04 10:31:33 +02:00
unknown
a6475b61b4
Python 3.4 support added and tested
2014-09-03 13:17:41 +02:00
Yusuke Shinyama
bb866ae148
Changed: new except syntax (2.6 or above).
2014-06-16 18:50:07 +09:00
Yusuke Shinyama
28e96ba3d0
Use print as a function.
2014-06-15 12:14:33 +09:00
Yusuke Shinyama
340387bfc6
Cleanup: isinstance
2014-03-28 17:50:59 +09:00
Yusuke Shinyama
f9079e4c0a
Fixed dumppdf.py issues.
2014-03-24 20:55:00 +09:00
Alex Rothberg
af8c4a6b8f
- only visit each objid once when dumping all objects
2013-11-18 20:41:09 -05:00
Matthew Duggan
c1da8b835c
PEP8: Remove trailing whitespace
2013-11-07 16:14:53 +09:00
Yusuke Shinyama
32844507ea
Fixed some style issues.
2013-10-19 08:41:01 +09:00
Yusuke Shinyama
28cb424f8f
Merge pull request #21 from eug48/master
...
dumppdf: support for extracting embedded files using the -E option
2013-10-18 16:23:09 -07:00
Yusuke Shinyama
6ca9ac5434
chmod fix.
2013-10-17 23:06:07 +09:00
Yusuke Shinyama
0ea08890d4
renamed: python2 -> python.
2013-10-17 23:05:27 +09:00
Yusuke Shinyama
f85c374cae
Separated PDFPage to pdfpage.py.
2013-10-10 19:54:55 +09:00
Yusuke Shinyama
c926874d20
API Change: the PDFDocument cstr now takes PDFParser. set_parser() is removed.
2013-10-10 18:40:06 +09:00
Yusuke Shinyama
2221163b94
Split pdfparser.py and pdfdocument.py.
2013-10-10 18:29:30 +09:00
Yusuke Shinyama
1467fc674c
Added fallback for broken PDFs.
2013-10-09 22:45:54 +09:00
Yusuke Shinyama
06425bba00
Introducing PDFObjectNotFound
2013-10-09 21:39:23 +09:00
eug
925845b172
dumppdf: support for extracting embedded files using the -E option
2013-01-20 13:29:35 +10:00
yusuke.shinyama.dummy
509ab66319
stay with python2
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@264 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-10-19 09:57:01 +00:00
yusuke.shinyama.dummy
afe33312c6
outline bug fixed
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@249 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-10-17 05:14:52 +00:00
yusuke.shinyama.dummy
ca5588a702
bugfix by Humberto Pereira
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@241 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-08-29 06:59:50 +00:00
yusuke.shinyama.dummy
9052cd1ea7
better TOC extraction
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@207 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-04-24 01:34:18 +00:00
yusuke.shinyama.dummy
2555b38836
fix typos (patches by sm)
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@183 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-02-15 14:50:19 +00:00
yusuke.shinyama.dummy
538a605ac0
several bugfixes.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@179 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-02-07 03:14:00 +00:00
yusuke.shinyama.dummy
dc6e5c366d
jpeg extraction support added.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@174 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-01-30 07:30:01 +00:00
yusuke.shinyama.dummy
98c8367339
warning removal.
...
code cleanup.
cmap bug fixed.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@168 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-01-01 03:09:26 +00:00
yusuke.shinyama.dummy
faa775897c
another bugfix
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@156 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-11-07 09:01:11 +00:00
yusuke.shinyama.dummy
f444c88e3d
testing against None with "is", not using "=="
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@153 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-11-06 15:10:29 +00:00
yusuke.shinyama.dummy
7790808560
to 4-space indentation
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@142 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-10-24 04:41:59 +00:00
yusuke.shinyama.dummy
3e12268bf6
rename package pdflib -> pdfminer.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@103 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-05-16 06:12:01 +00:00
yusuke.shinyama.dummy
f628c0d3fe
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@101 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-05-15 14:34:53 +00:00
yusuke.shinyama.dummy
43e5c05307
handle error when an object was not found in dumpxml()
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@92 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-04-26 15:03:47 +00:00
yusuke.shinyama.dummy
f8510edffc
AsciiHexDecode filter patch incorporated. Thanks to Troy Bollinger.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@86 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-04-08 10:55:01 +00:00
yusuke.shinyama.dummy
70e42bff04
encoding bug fixed.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@74 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-03-24 16:26:59 +00:00
yusuke.shinyama.dummy
b432a3f4ae
patch from Troy Bollinger.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@71 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-02-28 05:44:08 +00:00
yusuke.shinyama.dummy
91770edd46
foo
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@59 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-01-10 09:25:03 +00:00
yusuke.shinyama.dummy
24bdd33557
various bugfixes
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@56 1aa58f4a-7d42-0410-adbc-911cccaed67c
2009-01-05 04:40:50 +00:00
yusuke.shinyama.dummy
71be16febe
wordspace handling improved.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@55 1aa58f4a-7d42-0410-adbc-911cccaed67c
2008-12-25 15:09:54 +00:00
yusuke.shinyama.dummy
9740f26cec
outline (TOC) extraction supported.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@42 1aa58f4a-7d42-0410-adbc-911cccaed67c
2008-07-09 15:15:32 +00:00
yusuke.shinyama.dummy
07fc1799b3
improved html.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@38 1aa58f4a-7d42-0410-adbc-911cccaed67c
2008-06-29 10:53:39 +00:00
yusuke.shinyama.dummy
8a77664c6b
changed again...
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@36 1aa58f4a-7d42-0410-adbc-911cccaed67c
2008-06-29 08:49:28 +00:00
yusuke.shinyama.dummy
24fdae38d4
reorganize the directory structure.
...
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@35 1aa58f4a-7d42-0410-adbc-911cccaed67c
2008-06-29 08:45:46 +00:00