Skip to content

Commit

Permalink
Fix prompts (#106)
Browse files Browse the repository at this point in the history
fix prompts
  • Loading branch information
dillonalaird authored May 31, 2024
1 parent 2d76d9a commit 31f3bf6
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion vision_agent/agent/vision_agent_prompts.py
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@
1. **Understand and Clarify**: Make sure you understand the task.
2. **Algorithm/Method Selection**: Decide on the most efficient way.
3. **Pseudocode Creation**: Write down the steps you will follow in pseudocode.
4. **Code Generation**: Translate your pseudocode into executable Python code.
4. **Code Generation**: Translate your pseudocode into executable Python code. Ensure you use correct arguments, remember coordinates are always returned normalized from `vision_agent.tools`.
5. **Logging**: Log the output of the custom functions that were provided to you from `from vision_agent.tools import *`. Use a debug flag in the function parameters to toggle logging on and off.
"""

Expand Down
6 changes: 3 additions & 3 deletions vision_agent/tools/tools.py
Original file line number Diff line number Diff line change
Expand Up @@ -60,8 +60,8 @@ def grounding_dino(
) -> List[Dict[str, Any]]:
"""'grounding_dino' is a tool that can detect and count multiple objects given a text
prompt such as category names or referring expressions. The categories in text prompt
are separated by commas or periods. It returns a list and count of bounding boxes,
label names and associated probability scores.
are separated by commas or periods. It returns a list of bounding boxes with
normalized coordinates, label names and associated probability scores.
Parameters:
prompt (str): The prompt to ground to the image.
Expand All @@ -73,7 +73,7 @@ def grounding_dino(
Returns:
List[Dict[str, Any]]: A list of dictionaries containing the score, label, and
bounding box of the detected objects with normalized coordinates
bounding box of the detected objects with normalized coordinates between 0 and 1
(xmin, ymin, xmax, ymax). xmin and ymin are the coordinates of the top-left and
xmax and ymax are the coordinates of the bottom-right of the bounding box.
Expand Down

0 comments on commit 31f3bf6

Please sign in to comment.