pyxpdf changelog

0.2.3 (2020-08-31)

Config: make Config.cfg_path public attribute
Document: add support for ~pathlib.Path for loading pdf
pyxpdf_data: add 35 base Postscript fonts from ghostscript

Bugs Fixed

Fix #9: segfault using ~pyxpdf.xpdf.Document.text()
Fix #8: add checks for file in Config.add_font_file()

0.2.2 (2020-07-03)

Config: add function to add missing fonts Config.add_font_file()
Introduce ~pyxpdf.xpdf.PDFImage to represent a PDF Image.
PDFImageOutput: ~pyxpdf.xpdf.PDFImageOutput.get returns ~pyxpdf.xpdf.PDFImage instead of Pillow Image

0.2.1 (2020-06-12)

Bugs Fixed

fix all direct memory leaks
Config: fix Config.text_encoding setter, encodings with lowercase characters were not able to set.
fix weird bytes encoding problem in python debug builds

0.2.0 (2020-06-11)

Python 2.7 support dropped
2 optional dependencies (Pillow, pyxpdf_data) introduced

New Features

Introduce (optional) package pyxpdf_data which add more encoding support.
API: add specialised classes for pdf outputs, PDFOuputDevice.
- TextOutput - For Text extraction
- RawImageOutput - Render PDF Page as Image
- PDFImageOutput - Extract images from PDF
Config: add new global settings:
- Config.anti_alias
- Config.enable_freetype
- Config.vector_anti_alias

Bugs Fixed

pdftotext: extracted text contains clipped text even when explictly discarding it.
Config: fix loading of external xdfrc with Config.load_file()

0.1.1 (2020-05-10)

FIX: default Config.text_encoding value i.e UTF-8 does not persist Config.reset() and changes to Latin1.
pdftotext: remove all parameters that change global ~pyxpdf.xpdf.Config properties.

0.1 (2020-04-20)

Initial stable release.