documentation bit
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@201 1aa58f4a-7d42-0410-adbc-911cccaed67cpull/1/head
parent
e2e9adfaf3
commit
eafdc8830d
|
@ -19,7 +19,7 @@ Python PDF parser and analyzer
|
||||||
|
|
||||||
<div align=right class=lastmod>
|
<div align=right class=lastmod>
|
||||||
<!-- hhmts start -->
|
<!-- hhmts start -->
|
||||||
Last Modified: Mon Apr 5 23:15:31 UTC 2010
|
Last Modified: Wed Apr 7 10:20:18 UTC 2010
|
||||||
<!-- hhmts end -->
|
<!-- hhmts end -->
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
|
@ -42,11 +42,11 @@ Last Modified: Mon Apr 5 23:15:31 UTC 2010
|
||||||
<h2>What's It?</h2>
|
<h2>What's It?</h2>
|
||||||
<p>
|
<p>
|
||||||
PDFMiner is a suite of programs that help
|
PDFMiner is a suite of programs that help
|
||||||
extracting some meaningful information out of PDF documents.
|
extracting some information from PDF documents.
|
||||||
Unlike other PDF-related tools, it focuses entirely on getting
|
Unlike other PDF-related tools, it focuses entirely on getting
|
||||||
and analyzing text data from PDFs. PDFMiner allows to obtain
|
and analyzing text data. PDFMiner allows to obtain
|
||||||
the exact location of texts in a page, as well as
|
the exact location of texts in a page, as well as
|
||||||
other information such as fonts or ruled lines.
|
other information such as fonts or lines.
|
||||||
It includes a PDF converter that can transform PDF files
|
It includes a PDF converter that can transform PDF files
|
||||||
into other text formats (such as HTML). It has an extensible
|
into other text formats (such as HTML). It has an extensible
|
||||||
PDF parser that can be used for other purposes instead of text analysis.
|
PDF parser that can be used for other purposes instead of text analysis.
|
||||||
|
@ -54,6 +54,7 @@ PDF parser that can be used for other purposes instead of text analysis.
|
||||||
<strong>Features:</strong>
|
<strong>Features:</strong>
|
||||||
<ul>
|
<ul>
|
||||||
<li> Written entirely in Python. (for version 2.4 or newer)
|
<li> Written entirely in Python. (for version 2.4 or newer)
|
||||||
|
<li> Parse, analyze, and convert PDF documents.
|
||||||
<li> PDF-1.7 specification support. (well, almost)
|
<li> PDF-1.7 specification support. (well, almost)
|
||||||
<li> CJK languages and vertical writing scripts support.
|
<li> CJK languages and vertical writing scripts support.
|
||||||
<li> Various font types (Type1, TrueType, Type3, and CID) support.
|
<li> Various font types (Type1, TrueType, Type3, and CID) support.
|
||||||
|
|
Loading…
Reference in New Issue