Skip to content

A simple web interface which extracts texts from Images & PDF files using AWS Textract service

Notifications You must be signed in to change notification settings

deepeshdm/Textract-Web-App

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🗃 Extract Text from Images & PDFs using AWS Textract Service


To Run (Locally)

  1. Git clone the project repository on your local system
git clone https://github.com/deepeshdm/Textract-Web-App.git
  1. Install dependencies in requirements.txt
pip install -r requirements.txt
  1. In your AWS account create an IAM user and attach required policies to give this user full access of AWS S3 & Textract service.

  2. Generate the access keys for this IAM user through your AWS account and put them in an .env file inside the repo.

AWS_ACCESS_KEY_ID=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AWS_SECRET_ACCESS_KEY=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AWS_REGION=ap-south-1
  1. Start the streamlit server on localhost.
streamlit app.py

Releases

No releases published

Packages

No packages published

Languages