Image-to-Text Dilineation Extraction Dashboard

This program is a note-taking tool designed to extract text from images, select a desired portion of the text, and produce a CSV with the extracted portion and a categorical summarization of the text. This tool is particularly useful for digitizing handwritten notes and organizing them using platforms such as Obsidian.

This tool was made to show the capabilities of combing pre-trained models available through different providers (Google and OpenAI), and expand the possible use cases. You can easily create your own interactive dashboard and automate tedious processes such as uploading a ton of images. Potential upgrades are detecting highlighted words, separating handwritten and printed notes, detecting underlined or bold words, and much more.

Features

Extract text from images using Google Cloud Vision API
Select desired portion of the text by specifying start and end symbols
Produce a CSV file containing extracted text and its categorical summarization
Upload multiple images at once

Dependencies

Python 3.7 or later
Google Cloud Vision API
OpenAI API
Dash
Dash Bootstrap Components
Dash Core Components
Dash HTML Components
Dash Table
Pandas

Installation

Install the required Python packages:

pip install google-cloud-vision openai dash dash-bootstrap-components pandas

Make sure to have a valid API key for both Google Cloud Vision and OpenAI, and replace 'your-api-key' in the code with your OpenAI API key.
Run the application:

python app.py

Open a web browser and navigate to http://127.0.0.1:8050/ to access the Image Text Extraction Dashboard.

Usage

Drag and drop or select images containing text.
Enter the start and end symbols (optional) to extract a specific portion of the text.
Click the "Extract Text from Images" button.
The extracted text and its categorical summarization will be displayed in a table.
Click the "Download CSV" link to download the CSV file containing the extracted text and categories.

Note: The extraction of text and categories might take a while depending on the number and complexity of the images. Please be patient.

Troubleshooting

Make sure you have a valid API key for both Google Cloud Vision and OpenAI.
Ensure that you have the required Python packages installed and up to date.
If you're having issues with the extraction process, double-check the image quality and text legibility.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
main.py		main.py
noteswithdelineator.jpg		noteswithdelineator.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-to-Text Dilineation Extraction Dashboard

Features

Dependencies

Installation

Usage

Troubleshooting

About

Releases

Packages

Languages

branisk/ImageToText-Dilineation-Extraction

Folders and files

Latest commit

History

Repository files navigation

Image-to-Text Dilineation Extraction Dashboard

Features

Dependencies

Installation

Usage

Troubleshooting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages