You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
为什么需要这个功能?(Why do you need it?)
用户体验简化:客户端只需填写"auto"或自定义值,无需了解所有模型细节。
资源优化:根据不同策略选择最合适的模型,提升效率、控制成本。
灵活扩展:后端可以根据负载、价格、响应延迟等灵活切换模型。
需求描述(Is your feature request related to a problem?)
问题点
当前客户端传来的model字段可能是"auto",也可能是自定义的,比如"fastest"、"cheapest"等。后端需要根据实际策略(比如负载、价格、响应速度等)自动选择最终要用的模型。
解决方案思路(How could it be?)
输入
客户端传递的model字段可能是:“auto”、“fastest”、“cheapest”、"best_quality"等,或者某个真实模型名。
处理流程
判断model字段内容
Why do you need it?
Is your feature request related to a problem? Please describe in details
客户端请求的时候,model字段是"auto",或者其他自定义的值。
插件解析配置(随机,延迟,价格,吞吐,等)策略,去选择合适的模型
How could it be?
A clear and concise description of what you want to happen. You can explain more about input of the feature, and output of it.
Other related information
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: