Skip to content

What are the x-ratelimit-remaining-tokens and the x-ratelimit-remaining-requests headers mean? #138918

Answered by rajbos
honeming asked this question in Models
Discussion options

You must be logged in to vote

The rate limits refer to the overview in the documentation:

You have 12k tokens (in+out) per request and you can make up to 150 requests a day (with the Free / Copilot Individual level). That amounts to 1.8 million tokens a day, which is a lot lower then you have in that number, so I am not sure how that is calculated.

Perhaps they have temporarily higher limits during the beta? I have the lower numbers by the way. I've used a PAT here:

key value
x-ratelimit-remaining-tokens 1997968
x-ratelimit-remaining-requests 19998

I noticed this info in the response as well:

"usage": {
      "completion_tokens": 7,
      "prompt_tokens": 24,
      "total_tokens": 31
    }

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@rajbos
Comment options

Answer selected by honeming
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Models
Labels
Question Models Discussions and conversations related to GitHub Models
2 participants