ChatPDF

ChatPDF is an interactive application that allows users to upload PDF documents and engage in a question-answering session about the content of the document. It uses advanced natural language processing techniques to provide accurate responses and highlight relevant sections in the PDF.

Features

PDF document upload
Question-answering based on the document content
PDF preview with highlighted excerpts
Navigation through PDF pages

Technologies Used

Streamlit: For the web application interface
LangChain: For document processing and question-answering
FAISS: For efficient similarity search and retrieval
Groq: For the language model
Ollama: For text embeddings
PyMuPDF: For PDF processing

Setup and Installation

Clone the repository:

git clone https://github.com/imanoop7/Highliting-PDF-on-UI-using-Streamlit-for-RAG
cd chatpdf

Create a virtual environment and activate it:

python -m venv .venv
source .venv/bin/activate  # On Windows, use `.venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Set up environment variables: Create a .env file in the root directory and add your Groq API key:
```
GROQ_API_KEY=your_groq_api_key_here
```
Ensure Ollama is installed and running with the required model:
```
ollama pull nomic-embed-text
```

Running the Application

To run the application, use the following command:

streamlit run main.py

Then, open your web browser and navigate to the URL provided by Streamlit (usually http://localhost:8501).

Usage

Upload a PDF file using the file uploader.
Wait for the system to process the document and set up the QA system.
Once the system is ready, you can start asking questions about the document content.
The application will provide answers and highlight relevant sections in the PDF preview.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
main.py		main.py
main_with_ollama.py		main_with_ollama.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ChatPDF

Features

Technologies Used

Setup and Installation

Running the Application

Usage

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

imanoop7/Highliting-PDF-on-UI-using-Streamlit-for-RAG

Folders and files

Latest commit

History

Repository files navigation

ChatPDF

Features

Technologies Used

Setup and Installation

Running the Application

Usage

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages