[FEATURE_REQUEST] Using the llama.cpp server save/load slot api for group chats #2044

sasha0552 · 2024-04-08T22:12:35Z

Have you searched for similar requests?

Yes

Is your feature request related to a problem? If so, please describe.

No response

Describe the solution you'd like

Using the new llama.cpp server api to save/load slots for group chats (or maybe even regular chats and/or message histories "Start new chat"?). This is very useful to not have to reprocess the whole context on every message from different character.

Describe alternatives you've considered

Using group chats with context re-processing with each message from different character.

Additional context

I can create a PR that implements this feature.

Related: ggerganov/llama.cpp#6341

Priority

Low (Nice-to-have)

Are you willing to test this on staging/unstable branch if this is implemented?

Yes

sasha0552 added the 🦄 Feature Request [ISSUE] Suggestion for new feature, update or change label Apr 8, 2024

sasha0552 mentioned this issue Apr 10, 2024

Add some llama.cpp-specific endpoints #2057

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE_REQUEST] Using the llama.cpp server save/load slot api for group chats #2044

[FEATURE_REQUEST] Using the llama.cpp server save/load slot api for group chats #2044

sasha0552 commented Apr 8, 2024 •

edited

[FEATURE_REQUEST] Using the llama.cpp server save/load slot api for group chats #2044

[FEATURE_REQUEST] Using the llama.cpp server save/load slot api for group chats #2044

Comments

sasha0552 commented Apr 8, 2024 • edited

Have you searched for similar requests?

Is your feature request related to a problem? If so, please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Priority

Are you willing to test this on staging/unstable branch if this is implemented?

sasha0552 commented Apr 8, 2024 •

edited