Add Long Term Memory and Feedback #80

dillonalaird · 2024-05-10T17:51:05Z

Adding the remaining two items, long term memory and feedback, for the programming Vision Agent. Tried to make the vision agent more stateless. Calling chat_with_workflow returns a lot of stuff now so that the agent doesn't have to hold on to it as state:

working_memory is the trial and error reflections the model creates when debugging failed code. You can obtain this and use it as long term memory for future usage:

output = agent.chat_with_workflow([{"role": "user", "content": "..."}])
wm = output["working_memory"]

# can save and load it
wm.save("working_mem")
wm = va.utils.load_sim("working_mem")

# merge with existing long term memory
new_ltm = va.utils.merge_sim(wm, ltm)

# can use working memory as long term memory
agent = va.agent.VisionAgentV2(long_term_memory=new_ltm)

If a subtask in the plan fails, it will return the partially completed code and plan early. You can pass a partially completed plan/conversation back to the agent to finish:

output = agent.chat_with_workflow([{"role": "user", "content": "can you code this?"}])

output = agent.chat_with_workflow(
    [{
        "role": "user",
        "content": "can you code this?"
    }, {
        "role": "assistant",
        "content": output["code"],
    }, {
        "role": "user",
        "content": "No, can you use this library?"
    }],
    plan=output["plan"],
)

Or if you want to converse with the agent (passing the old plan back is optional and probably only useful if some part of the original plan failed). This way the chat itself stays stateless, and you can track the conversation/plan.

shankar-vision-eng

Had some questions

vision_agent/agent/vision_agent_v2.py

shankar-vision-eng · 2024-05-13T21:01:37Z

vision_agent/agent/vision_agent_v2.py

+ data["desc"].append(key)
+ data["doc"].append("\n".join(value))
+ df = pd.DataFrame(data) # type: ignore
+ return Sim(df, sim_key="desc")


Can you describe what happens in this function ? Sim returns a df with embedding calculated on the given column. From what i see we build df from a working memory which contains desc and doc. Are the desc and doc are description and doc string of tools or are they something else ? because i was under the assumption working memory is the all the artifacts from a previous run.

Oh yeah, for memory desc and doc are probably not the best terminology. In this case, desc is the subtask string and doc is the debug attempts at trying to build code for that subtask. For example, you might have:

desc: "load the file 'dog.png'"
doc: """
image = open('cat.png')

reflection: You opened the wrong image name, it should be 'dog.png'

image = open('dog.png'
"""

So this context could be saved as long term memory. Then the next time the agent encounters the question "load the file 'dog.png'" it could retrieve this context to help it.

vision_agent/agent/vision_agent_v2.py

shankar-vision-eng

LGTM

dillonalaird added 3 commits May 9, 2024 16:26

fixed save and load

35db97e

added long term memory

8d3c60d

added dynamic re-planning

bf50f74

dillonalaird requested review from shankar-vision-eng and AsiaCao May 10, 2024 17:51

dillonalaird added 3 commits May 13, 2024 13:17

add gpt-4o

e7ef95d

update tests

fd55636

update tests

13df90f

shankar-vision-eng reviewed May 13, 2024

View reviewed changes

dillonalaird added 4 commits May 13, 2024 14:27

fixed exit loop early

3bfab15

add some extra parsing for code snippets

6e09b1f

fix formatting

6bd0bb6

fix typing error

ebd4d96

shankar-vision-eng approved these changes May 13, 2024

View reviewed changes

dillonalaird merged commit d6fd63e into main May 13, 2024
7 checks passed

dillonalaird deleted the add-mem-feedback branch May 14, 2024 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Long Term Memory and Feedback #80

Add Long Term Memory and Feedback #80

dillonalaird commented May 10, 2024

shankar-vision-eng left a comment

shankar-vision-eng May 13, 2024

dillonalaird May 13, 2024

shankar-vision-eng left a comment

Add Long Term Memory and Feedback #80

Add Long Term Memory and Feedback #80

Conversation

dillonalaird commented May 10, 2024

shankar-vision-eng left a comment

Choose a reason for hiding this comment

shankar-vision-eng May 13, 2024

Choose a reason for hiding this comment

dillonalaird May 13, 2024

Choose a reason for hiding this comment

shankar-vision-eng left a comment

Choose a reason for hiding this comment