Update Groq Models #613

lumpidu · 2025-02-03T18:17:44Z

The llama-3.1-70b-versatile model has been removed from the Groq API. With deepseek-r1's rising popularity, testing distilled versions on Groq's infrastructure could be worthwhile.

add deepseek-r1-distill-llama-70B
replace llama-3.1-70b-versatile with llama-3.3-70b-versatile

Note that Groq doesn't support the max. context lengths of 128K on the free tier. At least I get an API error if I exceed ~6000 tokens:

 Request too large for model deepseek-r1-distill-llama-70b in organization XYZ service tier on_demand
 on tokens per minute (TPM): Limit 6000, Requested 16955, please reduce your message size and try again.
 Visit https://console.groq.com/docs/rate-limits for more information.

- add deepseek-r1-distill-llama-70B - replace llama-3.1-70b-versatile with llama-3.3-70b-versatile

krschacht · 2025-02-05T00:06:10Z

@lumpidu I think the issue is that you still had an assistant that was referencing the llama model you removed. I updated the assistants file.

Can you confirm this now works for you?

lumpidu and others added 3 commits February 3, 2025 18:13

Update Groq Models

268572a

- add deepseek-r1-distill-llama-70B - replace llama-3.1-70b-versatile with llama-3.3-70b-versatile

Update assistants.yml

66d52b0

Fix test

9dd6980

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Groq Models #613

Update Groq Models #613

lumpidu commented Feb 3, 2025 •

edited

Loading

krschacht commented Feb 5, 2025

Update Groq Models #613

Are you sure you want to change the base?

Update Groq Models #613

Conversation

lumpidu commented Feb 3, 2025 • edited Loading

krschacht commented Feb 5, 2025

lumpidu commented Feb 3, 2025 •

edited

Loading