PDFextract

PDFextract is a convenient CLI-wrapper for pdftk which enables the user to easily extract multiple pages (consecutively or discontinuous) from a given source PDF file. PDFextract saves the extracted artifacts as individual target PDF files (one for each page range) or combines them into a single target PDF.

Dependencies

pdftk must be installed on your system, otherwise PDFextract will fail to execute.

Installation

Clone the repository, then install the script by executing

$ python setup.py install

Examples

Extract pages from a single, continuous page range (pages 3 to 5) from source.pdf and save the output to target.pdf.
```
pdfextract source.pdf target.pdf 3-5
```
Extract pages from discontinuous page ranges (pages 3 to 5 and 7 to 12) from source.pdf and save the output to target.pdf. This will automatically yield several target PDFs, each suffixed with the respective page range.
```
pdfextract source.pdf target.pdf 3-5,7-12
```
Extract pages from discontinuous page ranges (pages 3 to 5 and 7 to 12) from source.pdf and save the output to a single target.pdf.
```
pdfextract source.pdf target.pdf 3-5,7-12 --join
```

License

This script is released under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
pdfextract		pdfextract
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pdfextract-runner.py		pdfextract-runner.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdfextract

pdfextract

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.md

README.md

pdfextract-runner.py

pdfextract-runner.py

setup.py

setup.py

Repository files navigation

PDFextract

Dependencies

Installation

Examples

License

About

Releases

Packages

Languages

License

mguenther/pdfextract

Folders and files

Latest commit

History

Repository files navigation

PDFextract

Dependencies

Installation

Examples

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages