-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dynamic Context Window Size for Ollama Chat #6582
Conversation
## Problem Statement Previously, the Ollama chat implementation used a fixed context window size of 32768 tokens. This caused two main issues: 1. Performance degradation due to unnecessarily large context windows for small conversations 2. Potential business logic failures when using smaller fixed sizes (e.g., 2048 tokens) ## Solution Implemented a dynamic context window size calculation that: 1. Uses a base context size of 8192 tokens 2. Applies a 1.2x buffer ratio to the total token count 3. Adds multiples of 8192 tokens based on the buffered token count 4. Implements a smart context size update strategy
@MarcusYuan @KevinHuSh |
@asiroliu "I don’t understand what you’re trying to say." |
@MarcusYuan |
@asiroliu What I did was dynamically modify Ollama's "num_ctx": ctx_size. This only solves the issue where a fixed (hardcoded) value would cause truncation when too many input tokens are processed. I don’t see how this relates to the video you mentioned above. |
@MarcusYuan Steps to reproduce:
git clone -b main https://github.com/MarcusYuan/ragflow.git 6582
cd 6582
docker build --progress=plain --build-arg LIGHTEN=1 --build-arg NEED_MIRROR=1 -f Dockerfile -t infiniflow/ragflow:6582 .
|
Dynamic Context Window Size for Ollama Chat
Problem Statement
Previously, the Ollama chat implementation used a fixed context window size of 32768 tokens. This caused two main issues:
Solution
Implemented a dynamic context window size calculation that:
Implementation Details
Token Counting Logic
Dynamic Context Calculation
Integration in Chat Method
Benefits
Future Considerations