School/College Stationary List OCR and Parsing
-
Updated
Apr 3, 2017 - C++
School/College Stationary List OCR and Parsing
Python tool for converting PDF files to text. Simplify your document processing tasks.
Minimize the time requirement of audit report analysis with a containerized file conversion and scraping system
Apply keyword procedures in a given Racket namespace using X-expressions.
AI-powered chatbot designed to simplify the job search process
An implementation of basic IR techniques from scratch.
ClearCouncil: Automated tools for collecting, organizing, and embedding publicly available local state county council documents (minutes, agendas) into LLMs. Python, JS, and wget scripts included for easy data retrieval and integration.
Program Helps remove watermark from a pdf document
Convert scans of handwritten notes to PDF.
This set of robots provides support for automatically obtaining information from invoices using docDigitizer API and keep track of the processed invoices on an Airtable repository
A document preprocessor that works in conjunction with tools like groff/troff & refer.
Pdf2xNet is a .NET library for seamless integration with Xpdf tools, enabling easy conversion of PDF documents to text, images, and HTML formats within your .NET applications.
通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script
Spire.Doc for C++ is a professional Word C++ library specifically designed for developers to create, read, write, convert, merge, split, and compare Word documents on any C++ platforms with fast and high-quality performance.
TU Dublin Computer Science MSc. Final Project Group 3 - Accessibilator
Generative intent detection with Magick
Use data from MongoDB in LangChain, Llama and OpenAI
FileGazer - deep file analysing and categorisation
Text line detection for Urdu OCR (UTRNet)
Add a description, image, and links to the document-processing topic page so that developers can more easily learn about it.
To associate your repository with the document-processing topic, visit your repo's landing page and select "manage topics."