optional max_tokens #4401

Algorithm5838 · 2024-03-27T05:44:42Z

Use a checkbox to optionally enable the use of max_tokens instead of having it disabled. This feature is useful for OpenAI models, as well as models from OpenRouter and other platforms.
I've set the default to 2048 for smaller context models (4k); however, 4096 is the preferred setting for newer models from OpenAI and Anthropic. Despite these models supporting much larger contexts, their output is capped at 4096.

vercel · 2024-03-27T05:44:45Z

@Algorithm5838 is attempting to deploy a commit to the NextChat Team on Vercel.

A member of the Team first needs to authorize it.

github-actions · 2024-03-27T05:47:47Z

Your build has completed!

Preview deployment

H0llyW00dzZ · 2024-03-28T19:38:47Z

Use a checkbox to optionally enable the use of max_tokens instead of having it disabled. This feature is useful for OpenAI models, as well as models from OpenRouter and other platforms. I've set the default to 2048 for smaller context models (4k); however, 4096 is the preferred setting for newer models from OpenAI and Anthropic. Despite these models supporting much larger contexts, their output is capped at 4096.

@Algorithm5838 Just letting you know, there is a bug related to the attach messages feature due to the max_tokens setting in this chat.ts file.
The logic needs to be refactored because the way attach messages work is not consistent, depending on the max_tokens value.

H0llyW00dzZ · 2024-03-28T19:39:27Z

related issue:

[Feature Request]: Improve Concatenated Send Messages Logic #4303

Algorithm5838 · 2024-03-28T19:48:51Z

You are correct. I encountered it before and solved it by commenting out this part:

          i >= contextStartIndex;// && tokenCount < maxTokenThreshold;

The issue with the logic is that they assumed max_tokens is input + output, where it is actually output only.
The right way is to include context tokens with the models.

H0llyW00dzZ · 2024-03-28T20:05:02Z

You are correct. I encountered it before and solved it by commenting out this part:
          i >= contextStartIndex;// && tokenCount < maxTokenThreshold;
The issue with the logic is that they assumed max_tokens is input + output, where it is actually output only. The right way is to include context tokens with the models.

I figured that out a few weeks ago when trying to implement support for anthropic with my friends.

optional max_tokens

8eb78c9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optional max_tokens #4401

optional max_tokens #4401

Algorithm5838 commented Mar 27, 2024 •

edited

vercel bot commented Mar 27, 2024

github-actions bot commented Mar 27, 2024

H0llyW00dzZ commented Mar 28, 2024

H0llyW00dzZ commented Mar 28, 2024

Algorithm5838 commented Mar 28, 2024 •

edited

H0llyW00dzZ commented Mar 28, 2024

optional max_tokens #4401

Are you sure you want to change the base?

optional max_tokens #4401

Conversation

Algorithm5838 commented Mar 27, 2024 • edited

vercel bot commented Mar 27, 2024

github-actions bot commented Mar 27, 2024

H0llyW00dzZ commented Mar 28, 2024

H0llyW00dzZ commented Mar 28, 2024

Algorithm5838 commented Mar 28, 2024 • edited

H0llyW00dzZ commented Mar 28, 2024

Algorithm5838 commented Mar 27, 2024 •

edited

Algorithm5838 commented Mar 28, 2024 •

edited