Skip to content

Releases: ptmrio/autorename-pdf

AutoRename-PDF v3.0.1

01 Apr 13:03

Choose a tag to compare

AutoRename-PDF v3.0.1

PaddleOCR 3.4.0 Compatibility

  • Fixed: NameError: name 'en' is not defined during setup — PowerShell string quoting issue
  • Fixed: ValueError: Unknown argument: show_log — removed deprecated parameter
  • Fixed: Wrong recognition model loaded — now uses explicit model mapping per language/script family
  • Fixed: PermissionError on model files installed by admin — setup.ps1 now sets ACLs via locale-independent SID
  • Fixed: Excessive RAM usage — switched from PP-OCRv5_server to PP-OCRv5_mobile models

Config Changes

  • Simplified: paddleocr.languages (list) → paddleocr.language (string) — old config auto-migrates
  • Language codes now map to script-family models: en → English-only, de/fr/es/etc. → Latin (covers all 59 Latin-script languages including English), ch → Chinese+Japanese
  • See config.yaml.example for full documentation

Improvements

  • Real-time stderr streaming during PaddleOCR model downloads — no more silent waits
  • Setup.ps1: UAC elevation message, better error handling for model preload
  • GUI: Language setting is now a simple text field with hint

Testing

  • 354 tests passing (unit + live integration across OpenAI, Anthropic, Ollama)

v3.0.0

25 Mar 13:18

Choose a tag to compare

v3.0.0

Major Release: GUI, Multi-Provider AI, and Three-Tier Extraction

AutoRename-PDF is now a full desktop application with a Tauri-based GUI, support for 5 AI providers, and a three-tier extraction pipeline.

What's New

  • Desktop GUI — Native Windows app built with Tauri 2, featuring drag-and-drop, dark theme, and real-time rename previews
  • Multi-provider AI — Choose between OpenAI, Anthropic, Google Gemini, xAI (Grok), or local Ollama models
  • Three-tier extraction — Text extraction (pdfplumber), optional OCR (PaddleOCR), and optional vision mode (send page images to LLM) for scanned/image-heavy PDFs
  • Structured output — All providers use instructor + Pydantic for reliable, typed AI responses
  • CLI JSON mode--output json for scripting and automation
  • Environment variable support — Use ${VAR_NAME} syntax in config.yaml, with .env file auto-loading
  • Comprehensive test suite — 344 tests, 95% coverage, including live integration tests across providers

Breaking Changes

  • Config schema v2 — Old v1 configs (openai.api_key) must be migrated to the new flat ai.* structure. See config.yaml.example.
  • use_gpu renamed to device — PaddleOCR config now uses device: "auto" | "cpu" | "gpu" instead of use_gpu: true/false. Old configs are auto-migrated with a deprecation warning.
  • Python 3.11+ required

Bug Fixes

  • Fixed Anthropic vision mode sending OpenAI-format image_url blocks instead of native image blocks
  • Updated default and test models to current generation (GPT-5.4, Claude Haiku 4.5)

Distribution

  • Portable ZIP: GUI + CLI + setup script
  • Context menu integration via setup.ps1
  • Optional PaddleOCR install (~500MB) for free local OCR

v2.1.2

15 Sep 11:13

Choose a tag to compare

What's Changed

  • Add funding section to README with business value messaging by @Copilot in #10
  • Fixed setup.ps1

New Contributors

  • @Copilot made their first contribution in #10

Full Changelog: v2.1.1...v2.1.2

v2.1.1

12 Sep 16:41

Choose a tag to compare

Full Changelog: v2.1.0...v2.1.1

Fixed --dpi bug (halucinated parameter)

v2.1.0

10 Sep 16:48

Choose a tag to compare

What's Changed

  • Refactored to yaml for easier use (.env has some problems with cloud sync and yaml is more end user friendly)
  • NOTE: Use an online tool to convert json to yaml, eg: https://jsonformatter.org/json-to-yaml
  • added Private GPT ability by @FrankBie in #5
  • IMPORTANT; We will not allow any Pull requests from now on, please use Issues

New Contributors

Full Changelog: v2.0.1...v2.1.0

v2.0.1

17 Sep 10:04

Choose a tag to compare

Full Changelog: v2.0.0...v2.0.1

Much more internalization

How to update:

  • download the .zip file and extract all files over your previous installation
  • open the .env file
  • copy all new .env variables from .env.example to your .env file and adapt as needed
  • if you added/changed your OCR_LANGUAGES, re-run the setup.ps1 in administrator mode.

v2.0.0

06 Sep 10:53

Choose a tag to compare

Full Changelog: v1.1.2...v2.0.0

No more poppler, switched to ocrmypdf and pymupdf.

Way easier installation.

Please consider funding 🙏

For Installation see README

v1.1.2

18 Apr 17:51

Choose a tag to compare

Full Changelog: v1.1.1...v1.1.2

Easier install with requirements.txt

17 Apr 19:05

Choose a tag to compare

Harmonize company names

17 Apr 14:25

Choose a tag to compare

You can now harmonize company names by providing a harmonized-company-names.json files.