Enhancement: Custom Token Rates for Endpoints (including Custom) #1633

danny-avila · 2024-01-25T13:03:35Z

What features would you like to see added?

Define own token rates via librechat.yaml

More details

Configuration from YAML file to affect and add to these values

// api/models/tx.js
/**
 * Mapping of model token sizes to their respective multipliers for prompt and completion.
 * @type {Object.<string, {prompt: number, completion: number}>}
 */
const tokenValues = {
  '8k': { prompt: 30, completion: 60 },
  '32k': { prompt: 60, completion: 120 },
  '4k': { prompt: 1.5, completion: 2 },
  '16k': { prompt: 3, completion: 4 },
  'gpt-3.5-turbo-1106': { prompt: 1, completion: 2 },
  'gpt-4-1106': { prompt: 10, completion: 30 },
};

// api/utils/tokens.js
const openAIModels = {
  'gpt-4': 8187, // -5 from max
  'gpt-4-0613': 8187, // -5 from max
  'gpt-4-32k': 32758, // -10 from max
  'gpt-4-32k-0314': 32758, // -10 from max
  'gpt-4-32k-0613': 32758, // -10 from max
  'gpt-3.5-turbo': 4092, // -5 from max
  'gpt-3.5-turbo-0613': 4092, // -5 from max
  'gpt-3.5-turbo-0301': 4092, // -5 from max
  'gpt-3.5-turbo-16k': 16375, // -10 from max
  'gpt-3.5-turbo-16k-0613': 16375, // -10 from max
  'gpt-3.5-turbo-1106': 16375, // -10 from max
  'gpt-4-1106': 127990, // -10 from max
  'mistral-': 31990, // -10 from max
};

// Order is important here: by model series and context size (gpt-4 then gpt-3, ascending)
const maxTokensMap = {
  [EModelEndpoint.openAI]: openAIModels,
  [EModelEndpoint.custom]: openAIModels,
  [EModelEndpoint.google]: {
    /* etc. */

Which components are impacted by your request?

No response

Pictures

No response

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

danny-avila added the enhancement New feature or request label Jan 25, 2024

luka-papez mentioned this issue Apr 2, 2024

[Snyk] Upgrade meilisearch from 0.33.0 to 0.38.0 luka-papez/LibreChat#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement: Custom Token Rates for Endpoints (including Custom) #1633

Enhancement: Custom Token Rates for Endpoints (including Custom) #1633

danny-avila commented Jan 25, 2024 •

edited

Enhancement: Custom Token Rates for Endpoints (including Custom) #1633

Enhancement: Custom Token Rates for Endpoints (including Custom) #1633

Comments

danny-avila commented Jan 25, 2024 • edited

What features would you like to see added?

More details

Which components are impacted by your request?

Pictures

Code of Conduct

danny-avila commented Jan 25, 2024 •

edited