- Config: make
Config.cfg_path
public attribute - Document: add support for
~pathlib.Path
for loading pdf - pyxpdf_data: add 35 base Postscript fonts from ghostscript
Bugs Fixed
- Fix #9: segfault using
~pyxpdf.xpdf.Document.text()
- Fix #8: add checks for file in
Config.add_font_file()
- Config: add function to add missing fonts
Config.add_font_file()
- Introduce
~pyxpdf.xpdf.PDFImage
to represent a PDF Image. - PDFImageOutput:
~pyxpdf.xpdf.PDFImageOutput.get
returns~pyxpdf.xpdf.PDFImage
instead of PillowImage
Bugs Fixed
- fix all direct memory leaks
- Config: fix
Config.text_encoding
setter, encodings with lowercase characters were not able to set. - fix weird bytes encoding problem in python debug builds
- Python 2.7 support dropped
- 2 optional dependencies (Pillow, pyxpdf_data) introduced
New Features
- Introduce (optional) package pyxpdf_data which add more encoding support.
API: add specialised classes for pdf outputs, PDFOuputDevice.
- TextOutput - For Text extraction
- RawImageOutput - Render PDF Page as Image
- PDFImageOutput - Extract images from PDF
- Config: add new global settings:
Config.anti_alias
Config.enable_freetype
Config.vector_anti_alias
Bugs Fixed
- pdftotext: extracted text contains clipped text even when explictly discarding it.
- Config: fix loading of external xdfrc with
Config.load_file()
- FIX: default
Config.text_encoding
value i.e UTF-8 does not persistConfig.reset()
and changes to Latin1. - pdftotext: remove all parameters that change global
~pyxpdf.xpdf.Config
properties.
Initial stable release.