Research Mode [Part 2]: Improve Prompts, Edit Chat Messages. Set LLM Seed for Reproducibility #954

debanjum · 2024-11-01T21:56:31Z

Improve chat actors and their prompts for research mode.
Add documentation to enable the code tool when self-hosting Khoj
Edit Chat Messages
- Store Turn Id in each chat message.
- Expose API to delete chat message.
- Expose delete chat message button to turn delete chat message from web app
Set LLM Generation Seed for Reproducible Debugging and Testing
- Setting seed for LLM generation is supported by Llama.cpp and OpenAI models.
  This can (somewhat) restrain LLM output
- Getting fixed responses for fixed inputs helps test, debug longer reasoning chains like used in advanced reasoning

Capability exists but idea needs to be investigated further

- Allow server to start if loading embedding model fails with an error. This allows fixing the embedding model config via admin panel. Previously server failed to start if embedding model was configured incorrectly. This prevented fixing the model config via admin panel. - Convert boolean string in config json to actual booleans when passed via admin panel as json before passing to model, query configs - Only create default model if no search model configured by admin. Return first created search model if its been configured by admin.

Anthropic models do not support seed. But offline, gemini and openai models do. Use these to debug and test Khoj via KHOJ_LLM_SEED env var

Each chat turn is a user query, khoj response message pair

- Match the online query generator prompt to match the formatting of extract questions - Separate iteration results by newline - Improve webpage and online tool descriptions

…easoning-and-other-misc-fixes

…ies that were not with selected tool when going through tool selection iterations

…easoning-and-other-misc-fixes

…ent creation

….com:khoj-ai/khoj into improve-debug-reasoning-and-other-misc-fixes

…prompt

debanjum and others added 30 commits October 30, 2024 14:00

Make cursor in chat input take on selected agent color

2ac840e

Defer turning cursor color to selected agents color for later

358a6ce

Capability exists but idea needs to be investigated further

Support setting seed for reproducible LLM response generation

b3a6301

Anthropic models do not support seed. But offline, gemini and openai models do. Use these to debug and test Khoj via KHOJ_LLM_SEED env var

Handle add/delete file filter operation on non-existent conversation

f64f5b3

Store turn id with each chat message. Expose API to delete chat turn

ba15686

Each chat turn is a user query, khoj response message pair

Add ability to delete messages from the web app

ca5a683

Resolve train of thought component needs unique key id error on web app

cb90abc

Fix deleting new messages generated after conversation load

e8e6ead

Put train of thought ui before Khoj response on web app

e17dc9f

Only show trash can when turnId is present

a137606

Do not exit if/else loop in research loop when notes not found

559601d

Only add /research prefix in research mode if not already in user query

5b15176

Json dump contents in prompt tracer to make structure discernable

89597ae

Improve research planner prompt to reduce looping

52163fe

Improve online chat actor prompt for research and normal mode

302bd51

- Match the online query generator prompt to match the formatting of extract questions - Separate iteration results by newline - Improve webpage and online tool descriptions

Merge branch 'master' of github.com:khoj-ai/khoj into improve-debug-r…

1924180

…easoning-and-other-misc-fixes

Remove conversation command always in query, filter out inferred quer…

21858ac

…ies that were not with selected tool when going through tool selection iterations

Use bottom anchor for the commandbar popover

149cbe1

Only include inferred-queries in chat history when present

3ea94ac

Set usage limits on the research mode

0145b2a

Add experimental notice to research mode tooltip

33d36ee

Handle case where infer_webpage_url returns no valid urls

1fc280d

Add template for a code sandbox to the docker-compose configuration

ffa7f95

Add documentation for python code execution capability

23a49b6

Standardize rate limits to 1/6 ratio

b3dad1f

Merge branch 'master' of github.com:khoj-ai/khoj into improve-debug-r…

2b35790

…easoning-and-other-misc-fixes

Simplify logic to get default search model. Remove unused import

ac21b10

Use standard per minute rate limits across user types

9c7b36d

Clarify description of the code evaluation environment: not for docum…

b79a9ec

…ent creation

sabaimran and others added 6 commits November 1, 2024 16:48

create defiltered query after conversation command is extracted

327fcb8

Limit the number of urls the webscraper can extract for scraping

a213b59

Merge branch 'improve-debug-reasoning-and-other-misc-fixes' of github…

e6eb87b

….com:khoj-ai/khoj into improve-debug-reasoning-and-other-misc-fixes

Clean API chat router. Move FeedbackData response type to router helper

1a83bbc

Expect query before tool in response to give think space in research …

ab321dc

…prompt

Add prompt tracing, agent personality to infer webpage urls chat actor

14e4530

debanjum force-pushed the improve-debug-reasoning-and-other-misc-fixes branch from fbc753c to 14e4530 Compare November 2, 2024 01:12

debanjum merged commit cff8e02 into master Nov 2, 2024
8 of 9 checks passed

debanjum deleted the improve-debug-reasoning-and-other-misc-fixes branch November 2, 2024 01:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research Mode [Part 2]: Improve Prompts, Edit Chat Messages. Set LLM Seed for Reproducibility #954

Research Mode [Part 2]: Improve Prompts, Edit Chat Messages. Set LLM Seed for Reproducibility #954

debanjum commented Nov 1, 2024 •

edited

Loading

Research Mode [Part 2]: Improve Prompts, Edit Chat Messages. Set LLM Seed for Reproducibility #954

Research Mode [Part 2]: Improve Prompts, Edit Chat Messages. Set LLM Seed for Reproducibility #954

Conversation

debanjum commented Nov 1, 2024 • edited Loading

debanjum commented Nov 1, 2024 •

edited

Loading