Releases: ptmrio/autorename-pdf
Releases · ptmrio/autorename-pdf
AutoRename-PDF v3.0.1
AutoRename-PDF v3.0.1
PaddleOCR 3.4.0 Compatibility
- Fixed:
NameError: name 'en' is not definedduring setup — PowerShell string quoting issue - Fixed:
ValueError: Unknown argument: show_log— removed deprecated parameter - Fixed: Wrong recognition model loaded — now uses explicit model mapping per language/script family
- Fixed:
PermissionErroron model files installed by admin — setup.ps1 now sets ACLs via locale-independent SID - Fixed: Excessive RAM usage — switched from PP-OCRv5_server to PP-OCRv5_mobile models
Config Changes
- Simplified:
paddleocr.languages(list) →paddleocr.language(string) — old config auto-migrates - Language codes now map to script-family models:
en→ English-only,de/fr/es/etc. → Latin (covers all 59 Latin-script languages including English),ch→ Chinese+Japanese - See
config.yaml.examplefor full documentation
Improvements
- Real-time stderr streaming during PaddleOCR model downloads — no more silent waits
- Setup.ps1: UAC elevation message, better error handling for model preload
- GUI: Language setting is now a simple text field with hint
Testing
- 354 tests passing (unit + live integration across OpenAI, Anthropic, Ollama)
v3.0.0
v3.0.0
Major Release: GUI, Multi-Provider AI, and Three-Tier Extraction
AutoRename-PDF is now a full desktop application with a Tauri-based GUI, support for 5 AI providers, and a three-tier extraction pipeline.
What's New
- Desktop GUI — Native Windows app built with Tauri 2, featuring drag-and-drop, dark theme, and real-time rename previews
- Multi-provider AI — Choose between OpenAI, Anthropic, Google Gemini, xAI (Grok), or local Ollama models
- Three-tier extraction — Text extraction (pdfplumber), optional OCR (PaddleOCR), and optional vision mode (send page images to LLM) for scanned/image-heavy PDFs
- Structured output — All providers use
instructor+ Pydantic for reliable, typed AI responses - CLI JSON mode —
--output jsonfor scripting and automation - Environment variable support — Use
${VAR_NAME}syntax in config.yaml, with.envfile auto-loading - Comprehensive test suite — 344 tests, 95% coverage, including live integration tests across providers
Breaking Changes
- Config schema v2 — Old v1 configs (
openai.api_key) must be migrated to the new flatai.*structure. Seeconfig.yaml.example. use_gpurenamed todevice— PaddleOCR config now usesdevice: "auto" | "cpu" | "gpu"instead ofuse_gpu: true/false. Old configs are auto-migrated with a deprecation warning.- Python 3.11+ required
Bug Fixes
- Fixed Anthropic vision mode sending OpenAI-format
image_urlblocks instead of nativeimageblocks - Updated default and test models to current generation (GPT-5.4, Claude Haiku 4.5)
Distribution
- Portable ZIP: GUI + CLI + setup script
- Context menu integration via
setup.ps1 - Optional PaddleOCR install (~500MB) for free local OCR
v2.1.2
v2.1.1
Full Changelog: v2.1.0...v2.1.1
Fixed --dpi bug (halucinated parameter)
v2.1.0
What's Changed
- Refactored to yaml for easier use (.env has some problems with cloud sync and yaml is more end user friendly)
- NOTE: Use an online tool to convert json to yaml, eg: https://jsonformatter.org/json-to-yaml
- added Private GPT ability by @FrankBie in #5
- IMPORTANT; We will not allow any Pull requests from now on, please use Issues
New Contributors
Full Changelog: v2.0.1...v2.1.0
v2.0.1
Full Changelog: v2.0.0...v2.0.1
Much more internalization
How to update:
- download the .zip file and extract all files over your previous installation
- open the
.envfile - copy all new .env variables from
.env.exampleto your.envfile and adapt as needed - if you added/changed your
OCR_LANGUAGES, re-run the setup.ps1 in administrator mode.
v2.0.0
v1.1.2
Full Changelog: v1.1.1...v1.1.2
Easier install with requirements.txt
Full Changelog: v1.1.0...v1.1.1
Harmonize company names
You can now harmonize company names by providing a harmonized-company-names.json files.