PDF Toolkit (Local Windows CLI or Obsidian Plugin)

Small, local, lightweight PDF utility CLI for Windows. Everything runs offline once dependencies are installed. Front-end through Obsidian plugin availible at https://github.com/duck-lint/pdf-toolkit-obsidian-plugin

Features:

Render PDF pages to PNGs (PyMuPDF)
Split a PDF into multiple PDFs
Rotate PDF pages or rotate PNGs (Pillow)
Split spread scans into single-page images and crop page bounds (Pillow)
Safe defaults with --dry-run and --overwrite
JSON manifest written for each command

Install (Windows)

Create and activate a virtual environment (optional but recommended):

python -m venv .venv
.venv\Scripts\Activate.ps1

Install dependencies:

pip install -r requirements.txt

Install this package in editable mode so python -m pdf-toolkit works:

pip install -e .

If you prefer not to install it, you can temporarily set PYTHONPATH:

$env:PYTHONPATH = "src"

Usage

See all commands:

python -m pdf-toolkit --help

Render PDF to PNG

python -m pdf-toolkit render --pdf "in.pdf" --out_dir "out\pages" --dpi 300 --format png --prefix "book1"

Dry-run (no files written):

python -m pdf-toolkit render --pdf "in.pdf" --out_dir "out\pages" --pages "1-10,15" --dry-run

Output naming is predictable: book1_p0001.png, book1_p0002.png, etc.

Split PDF

Explicit ranges:

python -m pdf-toolkit split --pdf "in.pdf" --out_dir "out\splits" --ranges "1-120,121-240" --prefix "book"

Automatic chunking:

python -m pdf-toolkit split --pdf "in.pdf" --out_dir "out\splits" --pages_per_file 120 --prefix "book"

Outputs: book_part01.pdf, book_part02.pdf, etc.

Rotate PDF pages

python -m pdf-toolkit rotate pdf --pdf "in.pdf" --out_pdf "in_rotated.pdf" --degrees 90 --pages "all"

In-place (overwrites input):

python -m pdf-toolkit rotate pdf --pdf "in.pdf" --out_pdf "in.pdf" --degrees 180 --pages "1-5" --inplace --overwrite

Rotate PNGs in a folder

python -m pdf-toolkit rotate images --in_dir "out\pages" --glob "*.png" --degrees 90 --out_dir "out\pages_rot"

In-place (overwrites files):

python -m pdf-toolkit rotate images --in_dir "out\pages" --glob "*.png" --degrees 90 --out_dir "out\pages" --inplace --overwrite

Page-images (split spreads + crop)

Auto mode (split if wide enough, otherwise crop-only):

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --glob "*.png" --mode auto --debug

Always split:

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --mode split --overwrite

Never split (crop-only):

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --mode crop

Page-images YAML config

Dump the default YAML config:

python -m pdf-toolkit page-images --dump-default-config

Use a config file:

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --config "configs\page_images.default.yaml"

Precedence is deterministic: built-in defaults < YAML config < explicitly provided CLI flags.

This means optional CLI defaults do not overwrite YAML values unless the flag is explicitly passed.

Supported YAML shapes:

Root form:

mode: auto
split_ratio: 1.25
crop_threshold: 180
pad_px: 20

Wrapped form:

page_images:
  mode: auto
  split_ratio: 1.25
  gutter_search_frac: 0.35
  crop_threshold: 180
  min_area_frac: 0.25

Recommended pipeline: render -> page-images

Page selection format

Pages are 1-based for user input:

all
1-10
1-10,15,20-25

Manifest output

Each command writes a JSON manifest describing:

Inputs, outputs, options
Actions taken (written, skipped, dry-run)
Timestamps

page-images action outputs list the written files, plus split/crop metadata (gutter_x, bboxes, spread detection notes).

["out/pages_single/book_p0001_L.png", "out/pages_single/book_p0001_R.png"]

By default the manifest is written to:

Render: out_dir\manifest.json
Split: out_dir\manifest.json
Rotate PDF: out_pdf folder\manifest.json
Rotate images: out_dir\manifest.json
Page-images: out_dir\manifest.json

--dry-run skips writing the manifest (it is treated like an output file).

Testing (lightweight)

Run the minimal unit tests:

python -m unittest discover -s tests -p "test_*.py"

Create a clean zip

Prefer creating release zips from git history:

git archive --format=zip --output pdf-toolkit.zip HEAD

If you zip manually, delete __pycache__/ folders and *.pyc files first.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
configs		configs
src/pdf-toolkit		src/pdf-toolkit
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Toolkit (Local Windows CLI or Obsidian Plugin)

Install (Windows)

Usage

Render PDF to PNG

Split PDF

Rotate PDF pages

Rotate PNGs in a folder

Page-images (split spreads + crop)

Page-images YAML config

Page selection format

Manifest output

Testing (lightweight)

Create a clean zip

About

Uh oh!

Releases

Packages

Languages

duck-lint/PDF-toolkit

Folders and files

Latest commit

History

Repository files navigation

PDF Toolkit (Local Windows CLI or Obsidian Plugin)

Install (Windows)

Usage

Render PDF to PNG

Split PDF

Rotate PDF pages

Rotate PNGs in a folder

Page-images (split spreads + crop)

Page-images YAML config

Page selection format

Manifest output

Testing (lightweight)

Create a clean zip

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages