Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Pinned

  1. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1.5k 101

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.5k 251

  3. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3k 273

  4. LLaMA-VID LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 595 38

  5. Video-P2P Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 333 22

  6. LLMGA LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 259 17

Repositories

Showing 10 of 63 repositories
  • MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3,016 Apache-2.0 273 42 2 Updated May 4, 2024
  • MR-GSM8K Public

    Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

    Python 35 0 2 0 Updated Apr 25, 2024
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1,492 Apache-2.0 101 52 1 Updated Apr 8, 2024
  • GroupContrast Public

    [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

    35 MIT 1 2 0 Updated Mar 15, 2024
  • Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 333 22 5 0 Updated Mar 12, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 224 MIT 29 5 0 Updated Feb 29, 2024
  • LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2,478 Apache-2.0 251 40 1 Updated Feb 11, 2024
  • Prompt-Highlighter Public

    [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

    Python 102 MIT 2 2 0 Updated Jan 25, 2024
  • LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 259 Apache-2.0 17 3 0 Updated Jan 22, 2024
  • LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 595 Apache-2.0 38 28 0 Updated Jan 10, 2024