Add tool testing #164

dillonalaird · 2024-07-08T22:52:24Z

This PR adds several items:

The planner now outputs 3 plans
A new tester agent tests the plans and picks the best one
LMM now supports mp4 files as media input
Max tokens increased, and traceback stripped of color codes
Remove reflection

shankar-vision-eng

Left some comments

shankar-vision-eng · 2024-07-09T21:42:10Z

vision_agent/agent/vision_agent_prompts.py

@@ -29,23 +29,110 @@
 {feedback}

 **Instructions**:
-1. Based on the context and tools you have available, write a plan of subtasks to achieve the user request.
-2. Go over the users request step by step and ensure each step is represented as a clear subtask in your plan.
+1. Based on the context and tools you have available, create a plan of subtasks to achieve the user request.


NIT - create a set of different plans to achieve the target in user request instead of create a plan of subtasks to achieve the user request. Subtasks seems to be more like sub plans. You might have to run thru benchmark

good catch, that's a good suggestion I'll test it out.

Not entirely sure why but this change significantly lowered the benchmark results

vision_agent/agent/vision_agent_prompts.py

vision_agent/lmm/lmm.py

shankar-vision-eng

Left few more comments

vision_agent/agent/vision_agent.py

shankar-vision-eng

LGTM!

Regarding the change for plan prompt, I leave it to you, Can you go thru the benchmark cases which scored lower. I'm assuming the benchmark will also not be deterministic so there should be some SD. If its within the SD, i will go with the prompt change

dillonalaird added 9 commits July 2, 2024 19:01

adding image support for va

c40cd0a

save

098fbb9

removed reflection

29e7ac3

image support for vision agent planning

479e037

handle more media types for lmm

08089a1

final changes

3c9d556

fix flake8

ea64cfa

format fix

7d0f7e9

add tool testing

ad20576

dillonalaird requested a review from shankar-vision-eng July 9, 2024 14:59

dillonalaird added 3 commits July 9, 2024 09:10

remove array types from printed tool results

e277003

fixed bug in prompt

c23df33

remove trailing space

db338b9

shankar-vision-eng reviewed Jul 9, 2024

View reviewed changes

vision_agent/agent/vision_agent.py Show resolved Hide resolved

vision_agent/agent/vision_agent.py Show resolved Hide resolved

shankar-vision-eng approved these changes Jul 10, 2024

View reviewed changes

dillonalaird merged commit 169d650 into main Jul 10, 2024
8 checks passed

dillonalaird deleted the add-tool-testing branch July 10, 2024 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tool testing #164

Add tool testing #164

dillonalaird commented Jul 8, 2024 •

edited

Loading

shankar-vision-eng left a comment

shankar-vision-eng Jul 9, 2024

dillonalaird Jul 9, 2024

dillonalaird Jul 10, 2024

shankar-vision-eng left a comment

shankar-vision-eng left a comment

Add tool testing #164

Add tool testing #164

Conversation

dillonalaird commented Jul 8, 2024 • edited Loading

shankar-vision-eng left a comment

Choose a reason for hiding this comment

shankar-vision-eng Jul 9, 2024

Choose a reason for hiding this comment

dillonalaird Jul 9, 2024

Choose a reason for hiding this comment

dillonalaird Jul 10, 2024

Choose a reason for hiding this comment

shankar-vision-eng left a comment

Choose a reason for hiding this comment

shankar-vision-eng left a comment

Choose a reason for hiding this comment

dillonalaird commented Jul 8, 2024 •

edited

Loading