-
Notifications
You must be signed in to change notification settings - Fork 128
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* added new tools to README * added documentation for chat with workflow * fixed usage for image caption * formatting fix * remove datastore docs * add box contains
- Loading branch information
1 parent
cec32f7
commit 431dae4
Showing
5 changed files
with
48 additions
and
86 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
### LMMs | ||
One of the problems of dealing with image data is it can be difficult to organize and | ||
search. For example, you might have a bunch of pictures of houses and want to count how | ||
many yellow houses you have, or how many houses with adobe roofs. The vision agent | ||
library uses LMMs to help create tags or descriptions of images to allow you to search | ||
over them, or use them in a database to carry out other operations. | ||
|
||
To get started, you can use an LMM to start generating text from images. The following | ||
code will use the LLaVA-1.6 34B model to generate a description of the image you pass it. | ||
|
||
```python | ||
import vision_agent as va | ||
|
||
model = va.lmm.get_lmm("llava") | ||
model.generate("Describe this image", "image.png") | ||
>>> "A yellow house with a green lawn." | ||
``` | ||
|
||
**WARNING** We are hosting the LLaVA-1.6 34B model, if it times out please wait ~3-5 | ||
min for the server to warm up as it shuts down when usage is low. |
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters