Official code for Paper "Mantis: Multi-Image Instruction Tuning"
-
Updated
May 19, 2024 - Python
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
The Freiburg Vision Test (FrACT) assesses visual acuities and contrast thresholds. It runs in any modern browser, or as webApp.
Blog and Portfolio page.
Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal !
FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.
Automate browser-based workflows with LLMs and Computer Vision
A fully-annotated, open-design dataset of autonomous and piloted high-speed flight
In This Repo I've Built Vision Transformer using PyTorch
An integrated software for network physiology of visual circuits in behaving mice --- 🎥 🔧 💽
🔠️👁️👀️👁️🔡️📖️ The official documentation source repository for the Advanced Eye Chart project.
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
📸 A powerful, high-performance React Native Camera library.
Recrafting Video Ads with Generative AI
Anthropic Claude API wrapper for Go
Add a description, image, and links to the vision topic page so that developers can more easily learn about it.
To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."