Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Support Anthropic on Vertex models. #574

Closed
simonbutt opened this issue Sep 30, 2024 · 4 comments
Closed

Feature Request: Support Anthropic on Vertex models. #574

simonbutt opened this issue Sep 30, 2024 · 4 comments

Comments

@simonbutt
Copy link

Currently Anthropic models on AWS Bedrock is supported but not Google Vertex.

"As a user of Inspect eval framework and also GCP, I'd like to use Anthropic models through Vertex AI in the Google Cloud project and region of my choice, rather than having to create a new billing process to use through the Anthropic API"

Details on all anthropic models:
https://docs.anthropic.com/en/docs/about-claude/models

@jjallaire-aisi
Copy link
Collaborator

I don't have access to Vertex to test, however I just made a change that should enable you to put together a PR for this. The Anthropic client already supports Bedrock, and I just made it possible to use an anthropic/bedrock/ model name prefix to opt into using this client: ca04ef5

Anthropic also has a built-in Vertex client, so you could add a vertex flag (and service name check) that binds to Vertex rather than Bedrock or the default Anthropic endpoint.

@lizaitang
Copy link

Currently Anthropic models on AWS Bedrock is supported but not Google Vertex.

"As a user of Inspect eval framework and also GCP, I'd like to use Anthropic models through Vertex AI in the Google Cloud project and region of my choice, rather than having to create a new billing process to use through the Anthropic API"

Details on all anthropic models: https://docs.anthropic.com/en/docs/about-claude/models

Hi, Simon, have you solved this issue?

@jjallaire
Copy link
Collaborator

I have attempted to implement this here: #1122

There are docs on how to make this work in the PR notes. Unfortunately when I try to make this work on my own account I get 429 errors (hard to know how to overcome these, billing and other config looks fine).

If others could install this branch and try it out I'd be appreciative.

@jjallaire
Copy link
Collaborator

I have merged this as it appears to all work correctly (save for the fact that all of my requests are rate limited and attempting to request an increase in rate limits for Anthropic models on Vertex results in an immediate "can't increase your quota above 0 at this time"). It may just be difficult to get Anthropic API resources provisioned through Vertex right now (I'm sure it depends on the customer though).

If people want to try this out on their own accounts here are the basic docs: #1122

You might also want to add --log-level=http so you can see the HTTP rate limits in the console.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants