pdfminer.six

Commit Graph

Author	SHA1	Message	Date
Fakabbir Amin	3d549ea48c	Removes code comments	2019-08-20 16:48:40 +05:30
Kaushik Acharya	963a227b2e	Updated URL for the article	2019-08-19 20:16:34 +05:30
Igor Moura	cf4641d877	Merge branch 'develop' into develop	2019-08-15 08:11:28 -03:00
Kaushik Acharya	bfbb8b8f0b	Adding Denis's article name.	2019-08-15 11:59:29 +05:30
Fakabbir Amin	3f0f05def6	Merge branch 'pdfstream-as-cmap' of https://github.com/fakabbir/pdfminer.six into pdfstream-as-cmap	2019-08-10 11:04:10 +05:30
Fakabbir Amin	3125d3634a	Correct old test cases	2019-08-10 11:03:28 +05:30
Fakabbir Amin	fe38695739	Merge branch 'develop' into pdfstream-as-cmap	2019-08-10 10:44:31 +05:30
Fakabbir Amin	5b210981c9	Adds Test Case	2019-08-10 10:19:20 +05:30
Fakabbir Amin	5a0d8db052	Adds decoder for OnebyteIdentityH/V instead of using default CMap	2019-08-10 10:07:23 +05:30
Tata Ganesh	48b25939c2	Merge pull request #264 from fakabbir/pdfstream-as-cmap Pdfstream as cmap	2019-07-31 22:20:37 +05:30
Tata Ganesh	42e2c8143b	Merge pull request #263 from pietermarsman/261-glyph-list-specification name2unicode() should follow the Adobe Glyph List Specification	2019-07-26 22:13:34 +05:30
Igor Moura	4eb6f769ea	Merge branch 'develop' of github.com:pdfminer/pdfminer.six into develop	2019-07-24 15:16:12 -03:00
Igor Moura	2f4518231f	Use resolve_all on PdfFont widths and bbox Fixes #268	2019-07-24 15:10:13 -03:00
Tata Ganesh	17364aa88e	Merge pull request #274 from igormp/iteritems Replaced .iteritems() with six.iteritems() for Python 2 and 3 compat	2019-07-24 23:16:25 +05:30
Igor Moura	540df9f676	Replaced .iteritems() and with six.iteritems() for Python 3 compat This is a squashed commit, the previous messages can be seen bellow This is the 1st commit message: Replaced .iteritems() usage for .items() Fixed some python 2 leftovers, as discussed in #267. Also formatted code according to Black.\nThis possibly breaks some python 2 compatibility This is the commit message #2: Reverted formatting and more spread six usage	2019-07-24 14:08:30 -03:00
Fakabbir Amin	f1a4dcea88	Adds Test Cases, Neater Code For CMap Assignment	2019-07-24 11:56:06 +05:30
Fakabbir Amin	b4c261b647	Removes Code Comments	2019-07-17 11:43:45 +05:30
Fakabbir Amin	fa400431f5	Adds Test, Removes Unnecessary Assumptions	2019-07-17 11:38:00 +05:30
Pieter Marsman	6f362f53fe	Raise a `KeyError` with a useful message if `unicode2name()` does not match any glyph name. Use this message to log debug statements.	2019-07-16 08:52:24 +02:00
Pieter Marsman	0fb83366b6	Remove intermediate variable `full_stop` because it is just a dot	2019-07-16 08:49:57 +02:00
Fakabbir Amin	cc40af3d2b	Removes @property, Adds docstring	2019-07-15 14:21:21 +05:30
Pieter Marsman	2bb850cdae	Fix error, python2 cannot handle unicode in a .py file	2019-07-14 15:43:07 +02:00
Pieter Marsman	1e24bfa0bd	Fix error, python2 cannot handle unicode in a .py file	2019-07-14 15:40:22 +02:00
Pieter Marsman	c597e95a9f	Use KeyError to signal that the name does not resemble any unicode, this pattern is also used in the rest of pdfminer.six	2019-07-14 15:37:15 +02:00
Pieter Marsman	fdb7e54862	Add lowercase adobe glyph name tests	2019-07-14 15:20:25 +02:00
Pieter Marsman	33cc9861ae	Add docstring to Type1FontHeaderParser.get_encoding() that describes that the custom CharStrings of the font are mapped to ''	2019-07-14 15:19:17 +02:00
Pieter Marsman	f0392f8049	Change implementation of name2unicode such that it follows the Adobe Glyph specs (with allowing lowercase)	2019-07-14 15:16:42 +02:00
Fakabbir Amin	8e4a82ad8b	Corrects Indentation	2019-07-13 05:00:25 +05:30
Fakabbir Amin	c022358c8d	Encapsulates character map name	2019-07-13 04:52:24 +05:30
Pieter Marsman	5d7ac7e88a	Added test for overflow error reported by @jtlz2: https://github.com/pdfminer/pdfminer.six/issues/177#issuecomment-510173228_	2019-07-10 20:44:23 +02:00
Pieter Marsman	ec5218a05f	Add some (failing) unittests for name2unicode based on the examples in the Adobe Glyph List Specification	2019-07-10 20:35:42 +02:00
Tata Ganesh	127d587431	Merge pull request #227 from BackEndTea/patch-1 Remove self refference on python3	2019-07-10 23:12:30 +05:30
Tata Ganesh	578464ab00	Merge pull request #259 from pietermarsman/258-add-contribution-guidelines Add contribution guidelines and refer to it in the readme	2019-07-10 23:10:25 +05:30
Pieter Marsman	5acfdd8f9b	Add sentence about including pdf's in issues	2019-07-10 19:38:38 +02:00
Andrew Baumann	6b312edd6a	Merge pull request #230 from 0xabu/unicode_glyph_bug name2unicode(): handle hexadecimal literals for unicode glyphs in text extraction Unicode literals are hex, not decimal (refer: https://github.com/adobe-type-tools/agl-specification). We are still far from conformance with the full spec, but this handles more PDFs seen in the wild.	2019-07-09 12:35:39 -07:00
Pieter Marsman	2743f2b20a	Add reference to contribution guidelines in README.md	2019-07-08 23:05:47 +02:00
Pieter Marsman	a03566da21	Add contribution guidelines	2019-07-08 23:03:47 +02:00
Andrew Baumann	c4c0a36e4f	name2unicode(): handle hexadecimal constants for unicode glyphs fixes #183, #229	2019-02-25 13:08:04 -08:00
John Kesegich	8ab2e287be	Handle PDFStream as character map name in PDFCIDFont	2019-02-25 11:42:30 -06:00
Gert de Pagter	594321fcf6	Remove self refference on python3 This is the 'six' repo, so no need to mention that again in the readme	2019-02-25 16:38:46 +01:00
Felix Schwarz	5ff84b83fb	use conditional requirements to ensure "chardet" listed as requirement on Python 3 (fixes #213 ) Previously "chardet" was added only added when setup.py was run with Python 3. However wheels contain a static list of requirements and a wheel-based install will never execute setup.py at installation time. pdfminer.six uses universal wheels for Python 2 and Python 3 so the requirements will always be wrong on one version (see #213). The solution is to use conditional requirements as specified in PEP 496 which are evaluated at installation time.	2019-01-18 11:24:51 +01:00
ganeshtata	b6a5848208	FEAT: Release 20181108	2018-11-08 22:37:11 +05:30
ganeshtata	90d61f2a3a	DOCS: changelog for version 20181108 added	2018-11-08 22:08:17 +05:30
Tata Ganesh	e03ecab856	Merge pull request #141 from timb07/speedup_layout Speed up layout of text boxes	2018-11-08 20:28:40 +05:30
James R. Barlow	2ede124142	Interpet font Descent as a negative number even if specified as positive The PDF RM specifies that Descent should be negative. Fonts that claim to have a positive Descent (not that it would make sense) always seem to be wrong about this claim.	2018-11-03 23:17:48 -07:00
Tata Ganesh	1ea9446bd6	Merge pull request #173 from wlbentley/pdf2txt-maketheparser Move argparse object setup out of pdf2txt.py main to separate function.	2018-08-21 23:40:50 +05:30
Wm Bentley	495c92e050	Move argparse object setup out of main to separate function. As preparation for implementing Sphinx documentation, create a separate function that builds and returns the argparse parser. Move import argparse out of main to the top of the file.	2018-08-12 21:07:52 -07:00
Tata Ganesh	259b29299e	Merge pull request #133 from timb07/speedup Speed up handling of PDFs with large images	2018-07-15 11:27:35 +05:30
Tata Ganesh	d437e5c637	Merge pull request #142 from mawoqiw/master Fixes needed to be able to compile pdfminer.six with Cython	2018-07-14 18:02:29 +05:30
Martin Wolf	edaf2c9e3f	move unittest to main()	2018-06-26 00:51:51 +02:00

1 2 3 4 5 ...

833 Commits (67e2d795918cbd22c0af5c91b5fecec322548cfb) All Branches Search

833 Commits (67e2d795918cbd22c0af5c91b5fecec322548cfb)

All Branches