Skip to content

Releases: junhoyeo/BetterOCR

v1.2.0 (🔍 Pororo OCR)

02 Nov 01:34
Compare
Choose a tag to compare

BetterOCR

🔍 BetterOCR combines results from multiple OCR engines with an 🧠 LLM to correct & reconstruct the output.

Before After (✨ latest at v1.2.0)
Screenshot 2023-11-02 at 10 22 01 AM Screenshot 2023-11-02 at 10 21 45 AM

Notes

Pororo is used only if the language options (lang) specified include either 🇺🇸 English (en) or 🇰🇷 Korean (ko). Also additional dependencies listed in [tool.poetry.group.pororo.dependencies] must be available. (If not, it'll automatically be excluded from enabled engines.)

What's Changed

New Contributors

  • @imgbot made their first contribution in #7

Full Changelog: v1.1.2...v1.2.0

v1.1.2 (🛠️ Bug Fix)

29 Oct 06:52
Compare
Choose a tag to compare

BetterOCR

🔍 BetterOCR combines results from multiple OCR engines with an 🧠 LLM to correct & reconstruct the output.

What's Changed

  • Fix Incorrect Inclusion of API_KEY in chat.completions.create Call by @snacsnoc in #3

New Contributors

Full Changelog: v1.1.1...v1.1.2

v1.1.1 (Bug Fixes for 📦 Box Detection)

29 Oct 05:59
Compare
Choose a tag to compare

BetterOCR

🔍 BetterOCR combines results from multiple OCR engines with an 🧠 LLM to correct & reconstruct the output.

What's Changed

  • Improved prompt in Box Detection
  • Fix bug inside detect_boxes's fallback logic (when LLM output format is invalid)

Full Changelog: v1.1.0...v1.1.1

v1.1.0 (📦 Box Detection)

29 Oct 04:59
Compare
Choose a tag to compare

BetterOCR

🔍 BetterOCR combines results from multiple OCR engines with an 🧠 LLM to correct & reconstruct the output.

What's Changed

Original Detected

Full Changelog: https://github.com/junhoyeo/BetterOCR/commits/v1.1.0

v1.0.1 (Initial Release 📖)

28 Oct 08:06
a47f842
Compare
Choose a tag to compare

BetterOCR

🔍 Better text detection by combining multiple OCR engines with 🧠 LLM.

OCR still sucks! ... Especially when you're from the other side of the world (and face a significant lack of training data in your language) — or just not thrilled with noisy results.

BetterOCR combines results from multiple OCR engines with an LLM to correct & reconstruct the output.

  • 🔍 OCR Engines: Currently supports EasyOCR and Tesseract.
  • 🧠 LLM: Supports Chat models from OpenAI.
  • 📒 Custom Context: Allows users to provide an optional context to use specific keywords such as proper nouns and product names. This assists in spelling correction and noise identification, ensuring accuracy even with rare or unconventional words.

Head over to 💯 Examples to view performace by languages (🇺🇸, 🇰🇷, 🇮🇳).

Coming Soon: improved interface, async support, box detection, and more.