Skip to content

v0.3.0

Latest
Compare
Choose a tag to compare
@sanjay920 sanjay920 released this 12 Nov 01:50
· 1 commit to main since this release

GPTParse v0.3.0

New Features

  • Added OCR mode for direct text extraction from PDFs and images
    • Supports PDF, PNG, JPG/JPEG files
    • Fast local processing without requiring AI services
    • Optional abort-on-error flag for better error handling
  • Enhanced CLI interface with four distinct processing modes:
    • Vision mode (AI-powered)
    • Fast mode (local processing)
    • Hybrid mode (combined approach)
    • OCR mode (direct text extraction)

Improvements

  • Added support for processing image files (PNG, JPG/JPEG) in OCR mode
  • Enhanced error handling and reporting
  • Improved documentation with comprehensive examples for all modes

Technical Details

  • Introduced new DoclingHandler for OCR processing
  • Updated CLI interface to support OCR commands and options
  • Added abort-on-error functionality for OCR processing

Usage

# New OCR mode examples
gptparse ocr document.pdf --output_file output.md
gptparse ocr scan.png --output_file output.md
gptparse ocr document.pdf --output_file output.md --abort-on-error

For full documentation and examples, please see the README.md.