documentation bit

git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@201 1aa58f4a-7d42-0410-adbc-911cccaed67c
pull/1/head
yusuke.shinyama.dummy 2010-04-10 11:04:53 +00:00
parent e2e9adfaf3
commit eafdc8830d
1 changed files with 5 additions and 4 deletions

View File

@ -19,7 +19,7 @@ Python PDF parser and analyzer
<div align=right class=lastmod>
<!-- hhmts start -->
Last Modified: Mon Apr 5 23:15:31 UTC 2010
Last Modified: Wed Apr 7 10:20:18 UTC 2010
<!-- hhmts end -->
</div>
@ -42,11 +42,11 @@ Last Modified: Mon Apr 5 23:15:31 UTC 2010
<h2>What's It?</h2>
<p>
PDFMiner is a suite of programs that help
extracting some meaningful information out of PDF documents.
extracting some information from PDF documents.
Unlike other PDF-related tools, it focuses entirely on getting
and analyzing text data from PDFs. PDFMiner allows to obtain
and analyzing text data. PDFMiner allows to obtain
the exact location of texts in a page, as well as
other information such as fonts or ruled lines.
other information such as fonts or lines.
It includes a PDF converter that can transform PDF files
into other text formats (such as HTML). It has an extensible
PDF parser that can be used for other purposes instead of text analysis.
@ -54,6 +54,7 @@ PDF parser that can be used for other purposes instead of text analysis.
<strong>Features:</strong>
<ul>
<li> Written entirely in Python. (for version 2.4 or newer)
<li> Parse, analyze, and convert PDF documents.
<li> PDF-1.7 specification support. (well, almost)
<li> CJK languages and vertical writing scripts support.
<li> Various font types (Type1, TrueType, Type3, and CID) support.