43 lines
1.3 KiB
Plaintext
43 lines
1.3 KiB
Plaintext
|
|
Installation:
|
|
|
|
1. Get http://www.unixuser.org/~euske/pub/CMap.tar.bz2
|
|
2. $ tar jxf CMap.tar.bz2
|
|
3. $ make cdbcmap
|
|
|
|
|
|
Dump the contents:
|
|
|
|
$ ./dumppdf.py foo.pdf
|
|
|
|
Extract the text:
|
|
|
|
$ ./pdf2txt.py foo.pdf > foo.xml
|
|
|
|
|
|
Terms and conditions:
|
|
|
|
Copyright (c) 2004-2008 Yusuke Shinyama <yusuke at cs dot nyu dot edu>
|
|
|
|
Permission is hereby granted, free of charge, to any person
|
|
obtaining a copy of this software and associated documentation
|
|
files (the "Software"), to deal in the Software without
|
|
restriction, including without limitation the rights to use,
|
|
copy, modify, merge, publish, distribute, sublicense, and/or
|
|
sell copies of the Software, and to permit persons to whom the
|
|
Software is furnished to do so, subject to the following
|
|
conditions:
|
|
|
|
The above copyright notice and this permission notice shall be
|
|
included in all copies or substantial portions of the Software.
|
|
|
|
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY
|
|
KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE
|
|
WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR
|
|
PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
|
|
COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
|
|
OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE
|
|
SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
|
|
|