GPTParse v0.3.0

New Features

Added OCR mode for direct text extraction from PDFs and images
- Supports PDF, PNG, JPG/JPEG files
- Fast local processing without requiring AI services
- Optional abort-on-error flag for better error handling
Enhanced CLI interface with four distinct processing modes:
- Vision mode (AI-powered)
- Fast mode (local processing)
- Hybrid mode (combined approach)
- OCR mode (direct text extraction)

Improvements

Added support for processing image files (PNG, JPG/JPEG) in OCR mode
Enhanced error handling and reporting
Improved documentation with comprehensive examples for all modes

Technical Details

Introduced new DoclingHandler for OCR processing
Updated CLI interface to support OCR commands and options
Added abort-on-error functionality for OCR processing

Usage

# New OCR mode examples
gptparse ocr document.pdf --output_file output.md
gptparse ocr scan.png --output_file output.md
gptparse ocr document.pdf --output_file output.md --abort-on-error

For full documentation and examples, please see the README.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.3.0

GPTParse v0.3.0

New Features

Improvements

Technical Details

Usage

Uh oh!