You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With huggignface runtime, we want to bring support for usage metrics like tokens (completion tokens + prompt tokens) Ref: usage field in OpenAI response https://platform.openai.com/docs/api-reference/making-requests This can be used by the client for calculating throughput(tokens/sec) etc
Validate: In streaming mode, can we get TTFT (Time to first token)?
The text was updated successfully, but these errors were encountered:
With huggignface runtime, we want to bring support for usage metrics like tokens (completion tokens + prompt tokens) Ref: usage field in OpenAI response https://platform.openai.com/docs/api-reference/making-requests This can be used by the client for calculating throughput(tokens/sec) etc
Validate: In streaming mode, can we get TTFT (Time to first token)?
The text was updated successfully, but these errors were encountered: