AnyGPT Unified Multimodal LLM with Discrete Sequence Modeling
BLIVA A Simple Multimodal LLM for Better Handling of Text-Rich Visual Question.pdf
InstructBLIP Towards General-purpose Vision-Language Models with Instruction Tuning.pdf
LLava Visual Instruction Tuning.pdf
LVLM eHub A Comprehensive Evaluation Benchmark for Large VisionLanguage Models.pdf
MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models.pdf
Mirasol3B A Multimodal Autoregressive model for time-aligned and contextual modalities.pdf
PaLM-E An Embodied Multimodal Language Model.pdf
TabLLM Few-shot Classification of Tabular Data with Large Language Models.pdf
Vary Scaling up the Vision Vocabulary for Large Vision-Language Models.pdf
Visual ChatGPT Talking, Drawing and Editing with Visual Foundation Models.pdf
mPLUG-Owl Modularization Empowers Large Language Models with Multimodality.pdf
Choose Your Weapon Survival Strategies for Depressed AI Academics.pdf
Folders and files Name Name Last commit message
Last commit date
parent directory Feb 26, 2024
Aug 25, 2023
May 15, 2023
Jul 10, 2023
Sep 4, 2023
Jul 10, 2023
Nov 29, 2023
Mar 23, 2023
Jul 27, 2023
Jan 2, 2024
Mar 23, 2023
Aug 31, 2023
View all files
You can’t perform that action at this time.