From 70e42bff043fd0db4f7b769f9985ddf7574b9005 Mon Sep 17 00:00:00 2001 From: "yusuke.shinyama.dummy" Date: Tue, 24 Mar 2009 16:26:59 +0000 Subject: [PATCH] encoding bug fixed. git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@74 1aa58f4a-7d42-0410-adbc-911cccaed67c --- README.html | 7 +- pdflib/cmap.py | 20 +++-- pdflib/pdffont.py | 221 ++++++++++++++++++++++++++++++++++++---------- tools/dumppdf.py | 2 +- 4 files changed, 197 insertions(+), 53 deletions(-) diff --git a/README.html b/README.html index 0e6418b..709476d 100644 --- a/README.html +++ b/README.html @@ -14,7 +14,7 @@ Python PDF parser and analyzer
-Last Modified: Mon Feb 2 00:01:01 JST 2009 +Last Modified: Wed Mar 25 01:26:33 JST 2009
@@ -175,6 +175,10 @@ By default, it extracts texts from all the pages.
-c codec
Specifies the output codec for non-ASCII texts.

+

-w +
Split each word into a different chunk in the output. +This makes the word spacing correctly handled. +

-t type
Specifies the output format. The following formats are currently supported.