All notable changes to the OCD (Organized Content Directory) project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
0.1.0 - 2025-08-03
- Initial Release 🎉
- Offline SLM Models: Specialized Small Language Models for file organization
- FileClassifierSLM: Intelligent file type and category detection
- SimilarityDetectorSLM: Duplicate and similar file identification
- Resource management with lazy loading and auto-unloading
- Memory optimization with quantization support (INT8/INT4)
- Smart File Organization: AI-powered file categorization and structuring
- 8+ file categories: Documents, Images, Code, Videos, Audio, Data, Archives, Other
- Multiple organization strategies: by_type, by_date, smart
- Pattern recognition for naming conventions
- Duplicate detection and handling
- Nested file extraction from subfolders
- LangChain Agent Architecture: Autonomous file organization agents
- OrganizationAgent: Natural language file organization
- NamingAgent: Intelligent file renaming
- CleanupAgent: Smart cleanup operations
- Safe file operation tools with validation and rollback
- CLI Interface: Modern command-line interface with rich output
ocd analyze: Directory analysis with offline AIocd organize: Intelligent file organization- Multiple modes: local-only, hybrid processing
- Natural language task support
- Dry-run mode for safe preview
- Safety & Security Features:
- Comprehensive dry-run mode
- File operation validation and rollback
- Safe operation logging and audit trail
- Privacy-first local processing option
- Cross-platform path handling
- Installation System: Zero-friction setup and dependency management
- Single-command installation:
python install.py - Automatic virtual environment creation
- Cross-platform dependency handling
- Optional development dependencies
- Single-command installation:
- Provider System: Extensible AI provider architecture
- Local SLM provider with specialized models
- Fallback system for reliability
- Resource monitoring and management
- Error handling and recovery
- Documentation: Comprehensive user and developer documentation
- User Guide with real-world examples
- Architecture documentation
- LangChain agent usage guide
- Contributing guidelines
- API reference
- Performance: Processes 45+ files in under 3 seconds
- Memory Usage: ~450MB for SLM models with automatic management
- Accuracy: 100% correct file categorization in testing
- Privacy: Complete local processing capability
- Cross-Platform: Tested on macOS, ready for Windows/Linux
- Python 3.9+
- PyTorch 2.0+ for SLM models
- Transformers 4.30+ for AI models
- Sentence-transformers 2.2+ for similarity detection
- LangChain 0.1+ for agent system
- Rich 13.0+ for CLI interface
- Additional dependencies automatically installed
- Manual testing with 45+ files of various types
- Successful organization across all file categories
- Duplicate detection and handling verified
- Cross-platform path handling tested
- Resource management and cleanup verified