
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Reading list for research topics in multimodal machine learning
About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_
A Practical Course on Embeddings, RAG, Multimodal Models, and Agents with Amazon Nova.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
All the open source AI Agents hosted on the oTTomator Live Agent Studio platform!
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A modular graph-based Retrieval-Augmented Generation (RAG) system
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks
This is a project, where I give you a way to use Autodesk Fusion 360 on Linux!
This repository contains the official firmware for Meshtastic, an open-source, off-grid mesh communication system.
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks
Low cost motion capture system for room scale tracking
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervise…
A modern desktop interface for Linux. Improve your user experience and get rid of the anarchy of traditional desktop workflows. Designed to simplify navigation and reduce the need to manipulate win…
Generative Agents: Interactive Simulacra of Human Behavior
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A python library for user-friendly forecasting and anomaly detection on time series.
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Meta-Transformer for Unified Multimodal Learning
Dual Swin Transformer for video-time-series fusion