#

vision

Here are 1,496 public repositories matching this topic...

TIGER-AI-Lab / Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

language video vision mantis vlm multimodal lmm fuyu mllm llava-llama3 multi-image-understanding

Updated May 19, 2024
Python

alexdredmon / crayeye

Multimodal LLM visual analysis multitool

app mobile ai vision llm

Updated May 18, 2024
Dart

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

Updated May 18, 2024
TypeScript

GoogleCloudPlatform / java-docs-samples

Java and Kotlin Code samples used on cloud.google.com

kotlin java appengine video cdn auth samples vision translate automl

Updated May 18, 2024
Java

FrACT10

michaelbach / FrACT10

The Freiburg Vision Test (FrACT) assesses visual acuities and contrast thresholds. It runs in any modern browser, or as webApp.

contrast vision psychophysics cappuccino objective-j visual-acuity

Updated May 18, 2024
Objective-J

Sanj-bot / codingINCV

The repo contains projects and learning related to computer vision

opencv computer vision cv2

Updated May 18, 2024
Python

prajolshrestha / prajolshrestha.github.io

Blog and Portfolio page.

machine-learning deep-learning signal-processing artificial-intelligence computer vision

Updated May 18, 2024

youkpan / gemini-assistant

Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal !

computer-vision assistant gemini webapp vision llm google-gemini gemini-pro gemini-15-pro gpt-4o gemini-flash

Updated May 18, 2024
TypeScript

eliranwong / freegenius

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

google ai gemini vision openai mistral autogen groq stable-diffusion chatgpt llava llamacpp ollama llama3

Updated May 18, 2024
Python

Skyvern-AI / skyvern

Automate browser-based workflows with LLMs and Computer Vision

python api workflow automation browser computer vision gpt browser-automation rpa playwright llm

Updated May 17, 2024
Python

tii-racing / drone-racing-dataset

A fully-annotated, open-design dataset of autonomous and piloted high-speed flight

control computer-vision robotics path-planning dataset vision motion-capture quadrotor visual-inertial-odometry motion-capture-data ros2 drone-racing autonomous-robots scene-understanding inertial-data

Updated May 17, 2024
Python

Amr-Abdellatif / Building-a-Vision-Transformer-from-scratch-using-PyTorch

In This Repo I've Built Vision Transformer using PyTorch

pytorch vision pytorch-implementation vision-transformer

Updated May 17, 2024
Jupyter Notebook

physion

yzerlaut / physion

An integrated software for network physiology of visual circuits in behaving mice --- 🎥 🔧 💽

neuroscience vision electrophysiology imaging

Updated May 17, 2024
Jupyter Notebook

yankailab / OpenKAI

OpenKAI: A modern framework for unmanned vehicle and robot control

framework robot drone pixhawk vision jetson unmanned

Updated May 17, 2024
C

seanpm2001 / Advanced_Eye_Chart_Docs

🔠️👁️👀️👁️🔡️📖️ The official documentation source repository for the Advanced Eye Chart project.

Updated May 17, 2024
Markdown

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

natural-language-processing computer-vision deep-learning robotics pytorch vision manipulation vision-and-language grounding vision-language

Updated May 17, 2024
Python

BiomedSciAI / fuse-med-ml

A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)

Updated May 17, 2024
Python

mrousavy / react-native-vision-camera

📸 A powerful, high-performance React Native Camera library.

Updated May 15, 2024
Swift

google-marketing-solutions / vigenair

Recrafting Video Ads with Generative AI

machine-learning video ai google-cloud vision video-editing video-ads video-generation vertex-ai large-language-models llm generative-ai video-to-video

Updated May 15, 2024
TypeScript

liushuangls / go-anthropic

Anthropic Claude API wrapper for Go

go golang ai vision streaming-api claude tool-use llm anthropic claude-ai claude-api function-calling

Updated May 15, 2024
Go

Improve this page

Add a description, image, and links to the vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."