Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Full Claude Sonnet 3.5 Support #234

Merged
merged 30 commits into from
Sep 23, 2024
Merged

Full Claude Sonnet 3.5 Support #234

merged 30 commits into from
Sep 23, 2024

Conversation

dillonalaird
Copy link
Member

@dillonalaird dillonalaird commented Sep 11, 2024

This PR adds several big items:

  • Adds AnthropicVisionAgent and makes it the default VisionAgent as well as VisionAgentCoder
  • Adds OpenAIVisionAgent and OpenAIVisionAgentCoder
  • Enables VisionAgent to view media to quickly answer questions about media
  • Adds fine_tune_id to owl_v2 and florence2_sam2_image, and changes meta_tools to handle fine_tune_ids for generic object detectors. This helps an issue where sometimes Claude 3.5 will prefer florence2_phrase_grounding because it can be fine tuned
  • Fixes issue with VisionAgent prompt that causes Claude 3.5 to occasionally output incorrect JSON responses
  • Passes the plan thoughts from planning to the coder, the plan thoughts now also include more information such as thresholds

Copy link
Collaborator

@shankar-vision-eng shankar-vision-eng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left some comments

vision_agent/agent/vision_agent.py Show resolved Hide resolved
vision_agent/agent/vision_agent_coder.py Show resolved Hide resolved
vision_agent/lmm/lmm.py Show resolved Hide resolved
vision_agent/tools/tools.py Show resolved Hide resolved
vision_agent/tools/tools.py Show resolved Hide resolved
Copy link
Collaborator

@shankar-vision-eng shankar-vision-eng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dillonalaird dillonalaird merged commit 696da6c into main Sep 23, 2024
8 checks passed
@dillonalaird dillonalaird deleted the full-claude-35-support branch September 23, 2024 18:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants