release-20110227
parent
e8cd880409
commit
f22b056454
|
@ -9,7 +9,7 @@
|
||||||
|
|
||||||
<div align=right class=lastmod>
|
<div align=right class=lastmod>
|
||||||
<!-- hhmts start -->
|
<!-- hhmts start -->
|
||||||
Last Modified: Mon Feb 14 13:31:54 UTC 2011
|
Last Modified: Sun Feb 27 10:51:18 UTC 2011
|
||||||
<!-- hhmts end -->
|
<!-- hhmts end -->
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
|
@ -184,7 +184,7 @@ Not all characters in a PDF can be safely converted to Unicode.
|
||||||
$ <strong>pdf2txt.py -o output.html samples/naacl06-shinyama.pdf</strong>
|
$ <strong>pdf2txt.py -o output.html samples/naacl06-shinyama.pdf</strong>
|
||||||
(extract text as an HTML file whose filename is output.html)
|
(extract text as an HTML file whose filename is output.html)
|
||||||
|
|
||||||
$ <strong>pdf2txt.py -c euc-jp -o output.html samples/jo.pdf</strong>
|
$ <strong>pdf2txt.py -V -c euc-jp -o output.html samples/jo.pdf</strong>
|
||||||
(extract a Japanese HTML file in vertical writing, CMap is required)
|
(extract a Japanese HTML file in vertical writing, CMap is required)
|
||||||
|
|
||||||
$ <strong>pdf2txt.py -P mypassword -o output.txt secret.pdf</strong>
|
$ <strong>pdf2txt.py -P mypassword -o output.txt secret.pdf</strong>
|
||||||
|
@ -270,6 +270,9 @@ are M = 1.0, L = 0.3, and W = 0.2, respectively.
|
||||||
<dd> Forces to perform layout analysis for all the text strings,
|
<dd> Forces to perform layout analysis for all the text strings,
|
||||||
including texts contained in figures.
|
including texts contained in figures.
|
||||||
<p>
|
<p>
|
||||||
|
<dt> <code>-V</code>
|
||||||
|
<dd> Allows vertical writing detection.
|
||||||
|
<p>
|
||||||
<dt> <code>-Y <em>layout_mode</em></code>
|
<dt> <code>-Y <em>layout_mode</em></code>
|
||||||
<dd> Specifies how the page layout should be preserved. (Currently only applies to HTML format.)
|
<dd> Specifies how the page layout should be preserved. (Currently only applies to HTML format.)
|
||||||
<ul>
|
<ul>
|
||||||
|
@ -354,6 +357,7 @@ no stream header is displayed for the ease of saving it to a file.
|
||||||
|
|
||||||
<h2><a name="changes">Changes</a></h2>
|
<h2><a name="changes">Changes</a></h2>
|
||||||
<ul>
|
<ul>
|
||||||
|
<li> 2010/02/27: Bugfixes and layout analysis improvements. Thanks to fujimoto.report.
|
||||||
<li> 2010/12/26: A couple of bugfixes and minor improvements. Thanks to Kevin Brubeck Unhammer and Daniel Gerber.
|
<li> 2010/12/26: A couple of bugfixes and minor improvements. Thanks to Kevin Brubeck Unhammer and Daniel Gerber.
|
||||||
<li> 2010/10/17: A couple of bugfixes and minor improvements. Thanks to standardabweichung and Alastair Irving.
|
<li> 2010/10/17: A couple of bugfixes and minor improvements. Thanks to standardabweichung and Alastair Irving.
|
||||||
<li> 2010/09/07: A minor bugfix. Thanks to Alexander Garden.
|
<li> 2010/09/07: A minor bugfix. Thanks to Alexander Garden.
|
||||||
|
|
|
@ -1,4 +1,4 @@
|
||||||
#!/usr/bin/env python2
|
#!/usr/bin/env python2
|
||||||
__version__ = '20101226'
|
__version__ = '20110227'
|
||||||
|
|
||||||
if __name__ == '__main__': print __version__
|
if __name__ == '__main__': print __version__
|
||||||
|
|
Loading…
Reference in New Issue