Handling rate limits for API requests #14

ScottBlinman · 2024-05-08T20:49:32Z

ScottBlinman
May 8, 2024

I have managed to get this system working on a Ubuntu server. Any ideas on how to handle the rate limits for OpenAI?

Is there a way we can use batching for the API requests that are sent?

Keen to discuss further

alnutile · 2024-06-14T10:23:07Z

wow just noticed this!

I am using Horizon to mange this. There is a new Horizon queue for OpenAi that I only allow 3 jobs at a time. Same with Ollama.

Hope it is working well!

0 replies

alnutile · 2024-06-14T11:09:22Z

0 replies