documentation bit
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@201 1aa58f4a-7d42-0410-adbc-911cccaed67cpull/1/head
parent
e2e9adfaf3
commit
eafdc8830d
|
@ -19,7 +19,7 @@ Python PDF parser and analyzer
|
|||
|
||||
<div align=right class=lastmod>
|
||||
<!-- hhmts start -->
|
||||
Last Modified: Mon Apr 5 23:15:31 UTC 2010
|
||||
Last Modified: Wed Apr 7 10:20:18 UTC 2010
|
||||
<!-- hhmts end -->
|
||||
</div>
|
||||
|
||||
|
@ -42,11 +42,11 @@ Last Modified: Mon Apr 5 23:15:31 UTC 2010
|
|||
<h2>What's It?</h2>
|
||||
<p>
|
||||
PDFMiner is a suite of programs that help
|
||||
extracting some meaningful information out of PDF documents.
|
||||
extracting some information from PDF documents.
|
||||
Unlike other PDF-related tools, it focuses entirely on getting
|
||||
and analyzing text data from PDFs. PDFMiner allows to obtain
|
||||
and analyzing text data. PDFMiner allows to obtain
|
||||
the exact location of texts in a page, as well as
|
||||
other information such as fonts or ruled lines.
|
||||
other information such as fonts or lines.
|
||||
It includes a PDF converter that can transform PDF files
|
||||
into other text formats (such as HTML). It has an extensible
|
||||
PDF parser that can be used for other purposes instead of text analysis.
|
||||
|
@ -54,6 +54,7 @@ PDF parser that can be used for other purposes instead of text analysis.
|
|||
<strong>Features:</strong>
|
||||
<ul>
|
||||
<li> Written entirely in Python. (for version 2.4 or newer)
|
||||
<li> Parse, analyze, and convert PDF documents.
|
||||
<li> PDF-1.7 specification support. (well, almost)
|
||||
<li> CJK languages and vertical writing scripts support.
|
||||
<li> Various font types (Type1, TrueType, Type3, and CID) support.
|
||||
|
|
Loading…
Reference in New Issue