diff --git a/Makefile b/Makefile index c09b472..01de158 100644 --- a/Makefile +++ b/Makefile @@ -10,7 +10,7 @@ VERSION=`$(PYTHON) $(PACKAGE)/__init__.py` DISTNAME=$(PACKAGE)-dist-$(VERSION) DISTFILE=$(DISTNAME).tar.gz -CONV_CMAP=$(PYTHON) -m tools.conv_cmap +CONV_CMAP=$(PYTHON) pdfminer/cmap.py all: @@ -27,8 +27,7 @@ test: cd samples && make test cdbcmap: CMap - -mkdir CDBCMap - $(CONV_CMAP) CMap/* + $(CONV_CMAP) CMap # Maintainance: commit: clean diff --git a/README.html b/README.html index bce814f..72e5bba 100644 --- a/README.html +++ b/README.html @@ -18,7 +18,7 @@ Python PDF parser and analyzer
-Last Modified: Sat Apr 4 09:14:28 JST 2009 +Last Modified: Sat May 16 19:58:11 JST 2009
@@ -26,23 +26,22 @@ Last Modified: Sat Apr 4 09:14:28 JST 2009

What's It?

-PDFMiner is a suite of programs that aims to help -analyzing text data from PDF documents. -It includes a PDF parser, a PDF renderer -(though only rendering text is supported for now), -and a couple of nice tools to extract texts. +PDFMiner is a suite of programs that help +extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as -other layout information such as font size or font name, -which could be useful for analyzing the document. +other extra information such as font information or ruled lines. +It includes a PDF converter that can transform PDF files +into other text formats (such as HTML). It has an extensible +PDF parser that can be used for other purpoes instead of text analysis.

Features: