Skip to content

Commit

Permalink
update README
Browse files Browse the repository at this point in the history
  • Loading branch information
zhang lu committed Jan 26, 2025
1 parent f8b1a20 commit fd627c3
Show file tree
Hide file tree
Showing 3 changed files with 9 additions and 2 deletions.
7 changes: 6 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,7 @@ OmAgent is python library for building multimodal language agents with ease. We
- A flexible agent architecture that provides graph-based workflow orchestration engine and various memory type enabling contextual reasoning.
- Native multimodal interaction support include VLM models, real-time API, computer vision models, mobile connection and etc.
- A suite of state-of-the-art unimodal and multimodal agent algorithms that goes beyond simple LLM reasoning, e.g. ReAct, CoT, SC-Cot etc.
- Supports local deployment of models. You can deploy your own models locally by using Ollama[Ollama](./docs/concepts/models/Ollama.md) or [LocalAI](./examples/video_understanding/docs/local-ai.md).


## 🛠️ How To Install
Expand Down Expand Up @@ -91,7 +92,11 @@ For more information about the container.yaml configuration, please refer to the

## 🤖 Example Projects
### 1. Video QA Agents
Build a system that can answer any questions about uploaded videos with video understanding agents. See Details [here](examples/video_understanding/README.md).
Build a system that can answer any questions about uploaded videos with video understanding agents. we provide a gradio based application, see details [here](examples/video_understanding/README.md).
<p >
<img src="docs/images/video_understanding_gradio.png" width="500"/>
</p>

More about the video understanding agent can be found in [paper](https://arxiv.org/abs/2406.16620).
<p >
<img src="docs/images/OmAgent.png" width="500"/>
Expand Down
Binary file added docs/images/video_understanding_gradio.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
4 changes: 3 additions & 1 deletion omagent-core/pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@

[tool.poetry]
name = "omagent_core"
version = "0.2.2"
version = "0.2.3"
description = "Core package for OmAgent"
authors = ["OM AI Lab <[email protected]>"]
readme = "README.md"
Expand Down Expand Up @@ -49,6 +49,8 @@ gradio = "^5.7.1"
tavily-python = "^0.5.0"
opencv-python = "^4.10.0.84"
wikipedia = "^1.4.0"
func_timeout = "^4.3.5"
sympy = "^1.13.3"

[build-system]
requires = ["poetry-core"]
Expand Down

0 comments on commit fd627c3

Please sign in to comment.