release 20100424

git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@210 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-04-24 04:32:21 +00:00 · 2010-04-24 04:32:21 +00:00 · a16eba30b7
parent 97848409e5
commit a16eba30b7
3 changed files with 14 additions and 11 deletions
--- a/docs/index.html
+++ b/docs/index.html
@ -19,7 +19,7 @@ Python PDF parser and analyzer

 <div align=right class=lastmod>
 <!-- hhmts start -->
-Last Modified: Sat Apr 24 02:48:00 UTC 2010
+Last Modified: Sat Apr 24 04:30:10 UTC 2010
 <!-- hhmts end -->
 </div>

@ -41,8 +41,7 @@ Last Modified: Sat Apr 24 02:48:00 UTC 2010
 <hr noshade>
 <h2>What's It?</h2>
 <p>
-PDFMiner is a suite of programs that help
-extracting some information from PDF documents.
+PDFMiner is a tool for extracting information from PDF documents.
 Unlike other PDF-related tools, it focuses entirely on getting 
 and analyzing text data. PDFMiner allows to obtain
 the exact location of texts in a page, as well as 
@ -270,6 +269,10 @@ are M = 1.0, L = 0.3, and W = 0.2, respectively.
 <dt> <code>-n</code> 
 <dd> Suppress layout analysis.
 <p>
+<dt> <code>-A</code> 
+<dd> Forces to perform layout analysis for all the text strings, 
+including texts contained in figures.
+<p>
 <dt> <code>-s <em>scale</em></code> 
 <dd> Specifies the output scale. Can be used in HTML format only.
 <p>
@ -374,6 +377,7 @@ no stream header is displayed for the ease of saving it to a file.
 <hr noshade>
 <h2>Changes</h2>
 <ul>
+<li> 2010/04/24: Bugfixes and tiny improvements on TOC extraction. Thanks to Jose Maria.
 <li> 2010/03/26: Bugfixes. Thanks to Brian Berry and Lubos Pintes.
 <li> 2010/03/22: Improved layout analysis. Added regression tests.
 <li> 2010/03/12: A couple of bugfixes. Thanks to Sean Manefield.
--- a/pdfminer/init.py
+++ b/pdfminer/init.py
@ -1,4 +1,4 @@
 #!/usr/bin/env python
-__version__ = '20100327'
+__version__ = '20100424'

 if __name__ == '__main__': print __version__
--- a/setup.py
+++ b/setup.py
@ -6,15 +6,14 @@ setup(
    name='pdfminer',
    version=__version__,
    description='PDF parser and analyzer',
-    long_description='''PDFMiner is a suite of programs that help
-extracting and analyzing text data from PDF documents.
-Unlike other PDF-related tools, it allows to obtain
+    long_description='''PDFMiner is a tool for extracting information from PDF documents.
+Unlike other PDF-related tools, it focuses entirely on getting 
+and analyzing text data. PDFMiner allows to obtain
 the exact location of texts in a page, as well as 
-other extra information such as font information or ruled lines.
-It can also infer its text flow and reconstruct the original layout.
-PDFMiner includes a PDF converter that can transform PDF files
+other information such as fonts or lines.
+It includes a PDF converter that can transform PDF files
 into other text formats (such as HTML). It has an extensible
-PDF parser library that can be used for other purposes instead of text analysis.''',
+PDF parser that can be used for other purposes instead of text analysis.''',
    license='MIT/X',
    author='Yusuke Shinyama',
    author_email='yusuke at cs dot nyu dot edu',