Add DINOv as a new tool #44

humpydonkey · 2024-04-09T16:43:31Z

Example usage

from vision_agent.tools.tools import DINOv


img_path = "/Users/asia/Downloads/data/bags.jpg"
request = {
    "prompt": [
        {
            "mask": "/Users/asia/Downloads/data/mask_prompt_bags0.jpg",
            "image": img_path,
        },
        {
            "mask": "/Users/asia/Downloads/data/mask_prompt_bags1.jpg",
            "image": img_path,
        },
    ],
    "image": img_path,
}
res = DINOv()(**request)
res

Co-authored-by: Yazhou Cao <[email protected]>

* Update prompts.py * Update vision_agent_prompts.py * Update reflexion_prompts.py * Update vision_agent_prompts.py * Update easytool_prompts.py * Update prompts.py * Update vision_agent_prompts.py

* get endpoint ready for demo fixed tools.json Update vision_agent/tools/tools.py Bug fixes * Fix linter errors * Fix a bug in result parsing * Include scores in the G-SAM model response * Removed tools.json , need to find better format * Fixing the endpoint for CLIP and adding thresholds for grounding tools * fix mypy errors * fixed example notebook --------- Co-authored-by: Yazhou Cao <[email protected]> Co-authored-by: shankar_ws3 <[email protected]>

Add a callback for reporting the chat progress of an agent

Co-authored-by: Yazhou Cao <[email protected]>

Fix another typo Co-authored-by: Yazhou Cao <[email protected]>

* fix visualization error * added font and score to viz * changed to smaller font file * Support streaming chat logs of an agent (#47) Add a callback for reporting the chat progress of an agent * fix visualize score issue * updated descriptions, fixed counter bug * added visualize_output * make feedback more concrete * made naming more consistent * replaced individual calc ops with calculator tool * fix random colors * fix prompts for tools * update reflection prompt * update readme * formatting fix * fixed mypy errors * fix merge issue --------- Co-authored-by: Asia <[email protected]>

added image caption tool

* Switch the host of model endpoint to api.dev.landing.ai * DRY/Abstract out the inference code in tools * Introduce LandingaiAPIKey and support loading from .env file * Add integration tests for four model tools * Minor tweaks/fixes * Remove dead code * Bump the minor version to 0.1.0

* visualized output/reflection to handle extract_frames_ * remove ipdb * added json mode for lmm, upgraded gpt-4-turbo * updated reflection prompt * refactor to make function simpler * updated reflection prompt, add tool usage doc * fixed format issue * fixed type issue * fixed test case

* Tweak frame extraction function * remove default motion detection, extract at 0.5 fps * lmm now take multiple images * removed counter * tweaked prompt * updated vision agent to reflect on multiple images * fix test case * added box distance * adjusted prompts --------- Co-authored-by: Yazhou Cao <[email protected]> Co-authored-by: Dillon Laird <[email protected]>

AsiaCao and others added 30 commits April 9, 2024 09:42

Add DINOv as a new tool

44feb20

Fix lint errors

b18eabe

Update docs

05b1ad5

Fix param name mismatch (#45)

cc2035c

Co-authored-by: Yazhou Cao <[email protected]>

Grammar/Spelling fixes (#46)

5ace291

* Update prompts.py * Update vision_agent_prompts.py * Update reflexion_prompts.py * Update vision_agent_prompts.py * Update easytool_prompts.py * Update prompts.py * Update vision_agent_prompts.py

Support streaming chat logs of an agent (#47)

073d40b

Add a callback for reporting the chat progress of an agent

Empty-Commit

dfce50b

Empty-Commit: attempt to fix release

1498307

[skip ci] chore(release): vision-agent 0.0.49

109eb87

Fix typo (#48)

32c4738

Co-authored-by: Yazhou Cao <[email protected]>

[skip ci] chore(release): vision-agent 0.0.50

a11c12d

Fix a typo in log (#49)

4da5d72

Fix another typo Co-authored-by: Yazhou Cao <[email protected]>

[skip ci] chore(release): vision-agent 0.0.51

e062992

[skip ci] chore(release): vision-agent 0.0.52

ec1a73b

Add image caption tool (#52)

fbe404c

added image caption tool

[skip ci] chore(release): vision-agent 0.0.53

9542d22

[skip ci] chore(release): vision-agent 0.1.1

66217a4

[skip ci] chore(release): vision-agent 0.1.2

1055aea

[skip ci] chore(release): vision-agent 0.1.3

da818ad

Merge branch 'main' into add-dinov

33cdf31

doc changes

94f94ba

fixed merge issues

d1d3268

fix color issue

81f3cf3

add dinov with updated endpoint

2a69082

formatting fix

34580e1

dillonalaird added 2 commits April 18, 2024 15:24

added reference mask support

6b232ed

fix linting

20a687a

dillonalaird approved these changes Apr 19, 2024

View reviewed changes

dillonalaird merged commit 7d72439 into main Apr 19, 2024
7 checks passed

dillonalaird deleted the add-dinov branch April 22, 2024 16:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DINOv as a new tool #44

Add DINOv as a new tool #44

humpydonkey commented Apr 9, 2024

Add DINOv as a new tool #44

Add DINOv as a new tool #44

Conversation

humpydonkey commented Apr 9, 2024