* Fix#795
* Documentation updates (FAQ and others)
* New how-to for extracting coordinates
* Indent fix in documentation
* Revert "Fix #795"
This reverts commit cac62171fc.
* Move description of iterating LTPage to the docstring of LTPage
* Remove adding how-to for extracting coordinates from this pr
* Add CHANGELOG.md
* Remove FAQ from this branch
* Only add one line to CHANGELOG.md
Co-authored-by: Kunal Gehlot <kunal.g@360hvpl.com>
* Fix typos in converting_pdf_to_text.rst
* The word "pdfminer.six" as a whole should not be separated by newline, otherwise they are treated as two separated words by renderer, and incorrectly displayed as separated.
* Trim redundant spaces
Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>
* Fix typos and possible mistakes.
* Revert two edits based on discussion in #579
Revert the two changes based on our discussion.
I read the documentation and had a glimpse at the default code. And perhaps the confusion was caused by the figure that shows the Char Margin (M) and the Word Margin (W). Clearly, M is smaller than W in absolute terms, but as mentioned, they are both relative numbers.
Maybe it is useful to point that out in the figure but I am not sure how best to do it.
Another option is to mention use something like `min_char_margin_threshold` or similar, in the hope that they are easier to understand. Just some thoughts!
* Triggering travis again
Co-authored-by: Pieter Marsman <pietermarsman@gmail.com>
* Make structure of documentation more clear: tutorials, how-to, topics and reference
* Add howto for images
* Restructure tutorials section, and add install section
* Always use up-to-date version
* Fix indentation warning in docstring
* Add option to dumppdf.py and pdf2txt.py to show version
Fixes#162