pdfminer.six/samples
yusuke.shinyama.dummy aa7e7d3e35 add a README file to show credits of the sample files.
git-svn-id: https://pdfminerr.googlecode.com/svn/trunk/pdfminer@223 1aa58f4a-7d42-0410-adbc-911cccaed67c
2010-06-06 05:16:37 +00:00
..
Makefile improved layout analysis. 2010-03-26 11:11:35 +00:00
README add a README file to show credits of the sample files. 2010-06-06 05:16:37 +00:00
dmca.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
dmca.pdf basic encryption support added. 2008-04-26 06:47:56 +00:00
dmca.txt.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
dmca.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
f1040nr.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
f1040nr.pdf basic encryption support added. 2008-04-26 06:47:56 +00:00
f1040nr.txt.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
f1040nr.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
i1040nr.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
i1040nr.pdf basic encryption support added. 2008-04-26 06:47:56 +00:00
i1040nr.txt.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
i1040nr.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
jo.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
jo.pdf add samples, fixed silly bugs. 2007-12-31 05:02:15 +00:00
jo.txt.ref writing mode detection 2010-03-25 11:38:47 +00:00
jo.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
kampo.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
kampo.pdf basic encryption support added. 2008-04-26 06:47:56 +00:00
kampo.txt.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
kampo.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
naacl06-shinyama.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
naacl06-shinyama.pdf add samples, fixed silly bugs. 2007-12-31 05:02:15 +00:00
naacl06-shinyama.txt.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
naacl06-shinyama.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
nlp2004slides.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
nlp2004slides.pdf basic encryption support added. 2008-04-26 06:47:56 +00:00
nlp2004slides.txt.ref consistent test results 2010-03-22 06:04:54 +00:00
nlp2004slides.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
simple1.html.ref improved layout analysis. 2010-03-26 11:11:35 +00:00
simple1.pdf testcase added 2009-10-24 02:50:07 +00:00
simple1.txt.ref add regression tests. 2010-03-22 04:34:52 +00:00
simple1.xml.ref test reference results changed 2010-04-10 11:29:40 +00:00
simple2.html.ref add regression tests. 2010-03-22 04:34:52 +00:00
simple2.pdf various cleanup for release. 2008-04-27 11:47:38 +00:00
simple2.txt.ref add regression tests. 2010-03-22 04:34:52 +00:00
simple2.xml.ref add regression tests. 2010-03-22 04:34:52 +00:00

README

This directory contains sample PDF files.

Here are the credits of the original files:

dmca.pdf: 
U.S. Copyright Office
The Digital Millenium Copyright Act
http://www.copyright.gov/legislation/dmca.pdf

f1040nr.pdf:
U.S. Department of the Treasury Internal Revenue Service
Form 1040-NR, U.S. Nonresident Alien Income Tax Return
http://www.irs.gov/pub/irs-pdf/f1040nr.pdf

i1040nr.pdf:
U.S. Department of the Treasury Internal Revenue Service
Instructions for Form 1040-NR, U.S. Nonresident Alien Income Tax Return
http://www.irs.gov/pub/irs-pdf/i1040nr.pdf

jo.pdf:
Kenji Miyazawa (1896-1933, copyright expired)
Preface of "Haru to Shura"
(File generated by LaTeX and dvi2pdfm)

kampo.pdf:
National Priting Bureau of Japan
Official Gazette, Vol. 4817
http://kanpou.npb.go.jp/

nlp2004slides.pdf:
Yusuke Shinyama and Satoshi Sekine
"Named Entity Discovery from Comparable News Corpora"

naacl06-shinyama.pdf:
Yusuke Shinyama and Satoshi Sekine
"Preemptive Information Extraction using Unrestircted Relation Discovery"

simple1.pdf:
(Originally taken from PDF Specification, 
Appendix G. "Simple Text String Example" and modified)

simple2.pdf:
(Originally taken from PDF Specification, 
Appendix G. "Simple Graphics Example" and modified)