Count system prompt tokens #850

mishig25 · 2024-02-20T14:11:13Z

On updating model settings:

On updating assitants settings:

gary149 · 2024-02-20T18:38:08Z

There're some errors with some models I think 👀

mishig25 · 2024-02-20T21:54:30Z

There're some errors with some models I think 👀

The reason being is that: https://huggingface.co/meta-llama/Llama-2-70b-chat-hf is a gated model. Therefore, the tokenizer code from frontend tries to grab the files but it is gated.

For now, the behaviour is that: if the model is not gated, the token count is shown. If it is gated, nothing appears.

Wwe have two poaaiblities:

Keep the current behaviour where no tokens are shown on gated models
Move the tokens counting code to backend/server so that tokens can be shown on all models.

wdyt ?

gary149 · 2024-02-21T09:29:22Z

I think the current behavior is good. But it shouldn't even try to tokenize if the tokenizer is not available (here it throws 100s of console errors).

mishig25 · 2024-03-18T16:41:25Z

I think the current behavior is good. But it shouldn't even try to tokenize if the tokenizer is not available (here it throws 100s of console errors).

handled it. Only the one error gets thrown in this case

gary149 · 2024-03-19T09:40:16Z

Just don't do it on llama-2? (as its gated it will never work) maybe it could be in the config.

nsarrazin

When it works it seems to work well, but I sometimes get console errors when switching between different models in the settings without reloading the page? But if I reload the page the tokenization works again. Not sure what is causing it

gary149 · 2024-03-19T09:41:25Z

When it works it seems to work well, but I sometimes get console errors when switching between different models in the settings without reloading the page? But if I reload the page the tokenization works again. Not sure what is causing it

Yes trying to reproduce that too 👀

gary149 · 2024-03-19T09:43:11Z

src/lib/components/TokensCounter.svelte

This won't work on local models with chat-ui. I would probably set an explicit key in the config for models that are compatible.

This reverts commit 61c0a22.

mishig25 · 2024-03-19T12:26:36Z

Updates:

Added model.tokenizer config. You can supply:

string - which would be used in transfomer.js AutoTokenizer

OR

{
    tokenizerUrl: string;
    tokenizerConfigUrl: string;
}  - which would be used to construct transfomer.js PreTrainedTokenizer

See .env.template file of this PR. Important: when testing locally make sure you have copied model.tokenizer from .env.template of this PR

Fixed reactivity issues mentioned above

here and here

Editing model system prompts

Screen.Recording.2024-03-19.at.12.21.00.PM.mov

Creating assistnat

Screen.Recording.2024-03-19.at.12.08.24.PM.mov

Editing assistnat

Screen.Recording.2024-03-19.at.12.20.18.PM.mov

gary149 · 2024-03-20T11:58:28Z

src/lib/components/AssistantSettings.svelte

+ classNames="absolute bottom-2 right-2"
+ prompt={systemPrompt}
+ modelTokenizer={model.tokenizer}
+ max_new_tokens={model?.parameters?.max_new_tokens}


use truncate value instead?

handled in f47c6bb

nsarrazin

Works quite well! 🔥

mishig25 changed the title ~~Count sysmte prompt tokens~~ Count system prompt tokens Feb 20, 2024

nsarrazin added enhancement New feature or request front This issue is related to the front-end of the app. back This issue is related to the Svelte backend or the DB labels Feb 22, 2024

Count sysmte prompt tokens

79ea91f

mishig25 force-pushed the show_number_of_tokens branch from af36fcc to 79ea91f Compare March 18, 2024 13:52

show error only once

d30cf10

mishig25 marked this pull request as ready for review March 18, 2024 15:12

mishig25 marked this pull request as draft March 18, 2024 15:17

mishig25 added 2 commits March 18, 2024 15:55

fix

cd321f7

fix css

9b134a0

mishig25 force-pushed the show_number_of_tokens branch from 799bb52 to 9b134a0 Compare March 18, 2024 16:40

mishig25 marked this pull request as ready for review March 18, 2024 16:41

mishig25 requested review from gary149 and nsarrazin March 18, 2024 16:41

simplify

0c29c74

mishig25 marked this pull request as draft March 18, 2024 16:56

fix

248426a

mishig25 marked this pull request as ready for review March 18, 2024 17:02

mishig25 and others added 2 commits March 18, 2024 17:08

simplify

61c0a22

Merge branch 'main' into show_number_of_tokens

cf42a9d

nsarrazin reviewed Mar 19, 2024

View reviewed changes

gary149 reviewed Mar 19, 2024

View reviewed changes

Revert "simplify"

acea4e2

This reverts commit 61c0a22.

mishig25 added 2 commits March 19, 2024 12:04

model.tokenizer config & fix reactivity issues

d7f6bfd

rm gated tokenizer

3fbbe8a

mishig25 requested review from nsarrazin and gary149 March 19, 2024 12:26

gary149 reviewed Mar 20, 2024

View reviewed changes

mishig25 and others added 2 commits March 20, 2024 12:09

use truncate

f47c6bb

Merge branch 'main' into show_number_of_tokens

ef3fd00

nsarrazin approved these changes Mar 21, 2024

View reviewed changes

gary149 self-requested a review March 22, 2024 12:39

gary149 approved these changes Mar 22, 2024

View reviewed changes

gary149 merged commit 50d8483 into main Mar 22, 2024
3 checks passed

gary149 deleted the show_number_of_tokens branch March 22, 2024 12:51

mishig25 mentioned this pull request Apr 3, 2024

Use jinja for chat templates #730

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Count system prompt tokens #850

Count system prompt tokens #850

mishig25 commented Feb 20, 2024 •

edited

gary149 commented Feb 20, 2024

mishig25 commented Feb 20, 2024

gary149 commented Feb 21, 2024

mishig25 commented Mar 18, 2024

gary149 commented Mar 19, 2024 •

edited

nsarrazin left a comment

gary149 commented Mar 19, 2024

gary149 Mar 19, 2024

mishig25 commented Mar 19, 2024

gary149 Mar 20, 2024

mishig25 Mar 20, 2024

nsarrazin left a comment

Count system prompt tokens #850

Count system prompt tokens #850

Conversation

mishig25 commented Feb 20, 2024 • edited

gary149 commented Feb 20, 2024

mishig25 commented Feb 20, 2024

gary149 commented Feb 21, 2024

mishig25 commented Mar 18, 2024

gary149 commented Mar 19, 2024 • edited

nsarrazin left a comment

Choose a reason for hiding this comment

gary149 commented Mar 19, 2024

gary149 Mar 19, 2024

Choose a reason for hiding this comment

mishig25 commented Mar 19, 2024

Updates:

Fixed reactivity issues mentioned above

Editing model system prompts

Creating assistnat

Editing assistnat

gary149 Mar 20, 2024

Choose a reason for hiding this comment

mishig25 Mar 20, 2024

Choose a reason for hiding this comment

nsarrazin left a comment

Choose a reason for hiding this comment

mishig25 commented Feb 20, 2024 •

edited

gary149 commented Mar 19, 2024 •

edited