Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DINOv as a new tool #44

Merged
merged 32 commits into from
Apr 19, 2024
Merged

Add DINOv as a new tool #44

merged 32 commits into from
Apr 19, 2024

Commits on Apr 9, 2024

  1. Add DINOv as a new tool

    AsiaCao committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    44feb20 View commit details
    Browse the repository at this point in the history
  2. Fix lint errors

    AsiaCao committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    b18eabe View commit details
    Browse the repository at this point in the history
  3. Update docs

    AsiaCao committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    05b1ad5 View commit details
    Browse the repository at this point in the history

Commits on Apr 17, 2024

  1. Fix param name mismatch (#45)

    Co-authored-by: Yazhou Cao <[email protected]>
    2 people authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    cc2035c View commit details
    Browse the repository at this point in the history
  2. Grammar/Spelling fixes (#46)

    * Update prompts.py
    
    * Update vision_agent_prompts.py
    
    * Update reflexion_prompts.py
    
    * Update vision_agent_prompts.py
    
    * Update easytool_prompts.py
    
    * Update prompts.py
    
    * Update vision_agent_prompts.py
    cmaloney111 authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    5ace291 View commit details
    Browse the repository at this point in the history
  3. Switch to the tools endpoint (#40)

    * get endpoint ready for demo
    
    fixed tools.json
    
    Update vision_agent/tools/tools.py
    
    Bug fixes
    
    * Fix linter errors
    
    * Fix a bug in result parsing
    
    * Include scores in the G-SAM model response
    
    * Removed tools.json , need to find better format
    
    * Fixing the endpoint for CLIP and adding thresholds for grounding tools
    
    * fix mypy errors
    
    * fixed example notebook
    
    ---------
    
    Co-authored-by: Yazhou Cao <[email protected]>
    Co-authored-by: shankar_ws3 <[email protected]>
    3 people authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    0cd57ef View commit details
    Browse the repository at this point in the history
  4. Support streaming chat logs of an agent (#47)

    Add a callback for reporting the chat progress of an agent
    humpydonkey authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    073d40b View commit details
    Browse the repository at this point in the history
  5. Empty-Commit

    AsiaCao authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    dfce50b View commit details
    Browse the repository at this point in the history
  6. Empty-Commit: attempt to fix release

    AsiaCao authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    1498307 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    109eb87 View commit details
    Browse the repository at this point in the history
  8. Fix typo (#48)

    Co-authored-by: Yazhou Cao <[email protected]>
    2 people authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    32c4738 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    a11c12d View commit details
    Browse the repository at this point in the history
  10. Fix a typo in log (#49)

    Fix another typo
    
    Co-authored-by: Yazhou Cao <[email protected]>
    2 people authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    4da5d72 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    e062992 View commit details
    Browse the repository at this point in the history
  12. Fix Baby Cam Use Case (#51)

    * fix visualization error
    
    * added font and score to viz
    
    * changed to smaller font file
    
    * Support streaming chat logs of an agent (#47)
    
    Add a callback for reporting the chat progress of an agent
    
    * fix visualize score issue
    
    * updated descriptions, fixed counter bug
    
    * added visualize_output
    
    * make feedback more concrete
    
    * made naming more consistent
    
    * replaced individual calc ops with calculator tool
    
    * fix random colors
    
    * fix prompts for tools
    
    * update reflection prompt
    
    * update readme
    
    * formatting fix
    
    * fixed mypy errors
    
    * fix merge issue
    
    ---------
    
    Co-authored-by: Asia <[email protected]>
    dillonalaird and humpydonkey committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    b7cdbee View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    ec1a73b View commit details
    Browse the repository at this point in the history
  14. Add image caption tool (#52)

    added image caption tool
    shankar-vision-eng authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    fbe404c View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    9542d22 View commit details
    Browse the repository at this point in the history
  16. refactor: switch model endpoints (#54)

    * Switch the host of model endpoint to api.dev.landing.ai
    * DRY/Abstract out the inference code in tools
    * Introduce LandingaiAPIKey and support loading from .env file
    * Add integration tests for four model tools
    * Minor tweaks/fixes
    * Remove dead code
    * Bump the minor version to 0.1.0
    humpydonkey authored and dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    248070e View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    66217a4 View commit details
    Browse the repository at this point in the history
  18. Pool Demo (#53)

    * visualized output/reflection to handle extract_frames_
    
    * remove ipdb
    
    * added json mode for lmm, upgraded gpt-4-turbo
    
    * updated reflection prompt
    
    * refactor to make function simpler
    
    * updated reflection prompt, add tool usage doc
    
    * fixed format issue
    
    * fixed type issue
    
    * fixed test case
    dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    fc94a2e View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    1055aea View commit details
    Browse the repository at this point in the history
  20. feat: allow disable motion detection in frame extraction function (#55)

    * Tweak frame extraction function
    
    * remove default motion detection, extract at 0.5 fps
    
    * lmm now take multiple images
    
    * removed counter
    
    * tweaked prompt
    
    * updated vision agent to reflect on multiple images
    
    * fix test case
    
    * added box distance
    
    * adjusted prompts
    
    ---------
    
    Co-authored-by: Yazhou Cao <[email protected]>
    Co-authored-by: Dillon Laird <[email protected]>
    3 people committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    c439bde View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    da818ad View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    33cdf31 View commit details
    Browse the repository at this point in the history
  23. doc changes

    dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    94f94ba View commit details
    Browse the repository at this point in the history
  24. fixed merge issues

    dillonalaird committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    d1d3268 View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2024

  1. fix color issue

    dillonalaird committed Apr 18, 2024
    Configuration menu
    Copy the full SHA
    81f3cf3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2a69082 View commit details
    Browse the repository at this point in the history
  3. formatting fix

    dillonalaird committed Apr 18, 2024
    Configuration menu
    Copy the full SHA
    34580e1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    6b232ed View commit details
    Browse the repository at this point in the history
  5. fix linting

    dillonalaird committed Apr 18, 2024
    Configuration menu
    Copy the full SHA
    20a687a View commit details
    Browse the repository at this point in the history