From e2e9adfaf359cc67757129471e98730984046ba6 Mon Sep 17 00:00:00 2001 From: "yusuke.shinyama.dummy" Date: Tue, 6 Apr 2010 10:51:16 +0000 Subject: [PATCH] wording git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@200 1aa58f4a-7d42-0410-adbc-911cccaed67c --- docs/index.html | 11 ++++------- 1 file changed, 4 insertions(+), 7 deletions(-) diff --git a/docs/index.html b/docs/index.html index 645f703..7d9e138 100644 --- a/docs/index.html +++ b/docs/index.html @@ -19,7 +19,7 @@ Python PDF parser and analyzer
-Last Modified: Sun Mar 28 07:21:28 UTC 2010 +Last Modified: Mon Apr 5 23:15:31 UTC 2010
@@ -46,7 +46,7 @@ extracting some meaningful information out of PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data from PDFs. PDFMiner allows to obtain the exact location of texts in a page, as well as -other extra information such as font information or ruled lines. +other information such as fonts or ruled lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis. @@ -131,11 +131,8 @@ W o r l d

For CJK languages

-In order to handle CJK languages, -an additional data called CMap is required. -CMap files are not installed by default. -

-Here is the additional step you need to take: +In order to process CJK languages, you need an additional step to take +during installation:

 # make cmap
 python tools/conv_cmap.py pdfminer/cmap Adobe-CNS1 cmaprsrc/cid2code_Adobe_CNS1.txt cp950 big5