release 20100424

git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@210 1aa58f4a-7d42-0410-adbc-911cccaed67c
pull/1/head
yusuke.shinyama.dummy 2010-04-24 04:32:21 +00:00
parent 97848409e5
commit a16eba30b7
3 changed files with 14 additions and 11 deletions

View File

@ -19,7 +19,7 @@ Python PDF parser and analyzer
<div align=right class=lastmod>
<!-- hhmts start -->
Last Modified: Sat Apr 24 02:48:00 UTC 2010
Last Modified: Sat Apr 24 04:30:10 UTC 2010
<!-- hhmts end -->
</div>
@ -41,8 +41,7 @@ Last Modified: Sat Apr 24 02:48:00 UTC 2010
<hr noshade>
<h2>What's It?</h2>
<p>
PDFMiner is a suite of programs that help
extracting some information from PDF documents.
PDFMiner is a tool for extracting information from PDF documents.
Unlike other PDF-related tools, it focuses entirely on getting
and analyzing text data. PDFMiner allows to obtain
the exact location of texts in a page, as well as
@ -270,6 +269,10 @@ are M = 1.0, L = 0.3, and W = 0.2, respectively.
<dt> <code>-n</code>
<dd> Suppress layout analysis.
<p>
<dt> <code>-A</code>
<dd> Forces to perform layout analysis for all the text strings,
including texts contained in figures.
<p>
<dt> <code>-s <em>scale</em></code>
<dd> Specifies the output scale. Can be used in HTML format only.
<p>
@ -374,6 +377,7 @@ no stream header is displayed for the ease of saving it to a file.
<hr noshade>
<h2>Changes</h2>
<ul>
<li> 2010/04/24: Bugfixes and tiny improvements on TOC extraction. Thanks to Jose Maria.
<li> 2010/03/26: Bugfixes. Thanks to Brian Berry and Lubos Pintes.
<li> 2010/03/22: Improved layout analysis. Added regression tests.
<li> 2010/03/12: A couple of bugfixes. Thanks to Sean Manefield.

View File

@ -1,4 +1,4 @@
#!/usr/bin/env python
__version__ = '20100327'
__version__ = '20100424'
if __name__ == '__main__': print __version__

View File

@ -6,15 +6,14 @@ setup(
name='pdfminer',
version=__version__,
description='PDF parser and analyzer',
long_description='''PDFMiner is a suite of programs that help
extracting and analyzing text data from PDF documents.
Unlike other PDF-related tools, it allows to obtain
long_description='''PDFMiner is a tool for extracting information from PDF documents.
Unlike other PDF-related tools, it focuses entirely on getting
and analyzing text data. PDFMiner allows to obtain
the exact location of texts in a page, as well as
other extra information such as font information or ruled lines.
It can also infer its text flow and reconstruct the original layout.
PDFMiner includes a PDF converter that can transform PDF files
other information such as fonts or lines.
It includes a PDF converter that can transform PDF files
into other text formats (such as HTML). It has an extensible
PDF parser library that can be used for other purposes instead of text analysis.''',
PDF parser that can be used for other purposes instead of text analysis.''',
license='MIT/X',
author='Yusuke Shinyama',
author_email='yusuke at cs dot nyu dot edu',