GPTParse v0.3.0
New Features
- Added OCR mode for direct text extraction from PDFs and images
- Supports PDF, PNG, JPG/JPEG files
- Fast local processing without requiring AI services
- Optional abort-on-error flag for better error handling
- Enhanced CLI interface with four distinct processing modes:
- Vision mode (AI-powered)
- Fast mode (local processing)
- Hybrid mode (combined approach)
- OCR mode (direct text extraction)
Improvements
- Added support for processing image files (PNG, JPG/JPEG) in OCR mode
- Enhanced error handling and reporting
- Improved documentation with comprehensive examples for all modes
Technical Details
- Introduced new DoclingHandler for OCR processing
- Updated CLI interface to support OCR commands and options
- Added abort-on-error functionality for OCR processing
Usage
# New OCR mode examples
gptparse ocr document.pdf --output_file output.md
gptparse ocr scan.png --output_file output.md
gptparse ocr document.pdf --output_file output.md --abort-on-error
For full documentation and examples, please see the README.md.