Skip to content

explosion/prodigy-pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📄 Prodigy-PDF

This repository contains a Prodigy plugin for recipes that annotating PDF files. At the moment it features a recipe that can turn the PDF into an image, which can then be annotated using the familiar image_manual interface. Here's a preview of the interface:

You can install this plugin via pip.

pip install "prodigy-pdf @ git+https://github.com/explosion/prodigy-pdf"

If you want to use the OCR recipes, you'll also want to ensure that tesseract is installed.

# for mac 
brew install tesseract

# for ubuntu
sudo apt install tesseract-ocr

To learn more about this plugin, you can check the Prodigy docs.

Issues?

Are you have trouble with this plugin? Let us know on our support forum and we'll get back to you!