Add support for DeepseekAI's DeepseekVL #36248

geetu040 · 2025-02-18T07:41:43Z

What does this PR do?

This PR adds DeepseekAI's DeepseekVL model to Hugging Face Transformers.

DeepseekVL is an open-source Vision-Language (VL) Model designed for real-world vision and language understanding applications. DeepSeek-VL possesses general multimodal understanding capabilities, capable of processing logical diagrams, web pages, formula recognition, scientific literature, natural images, and embodied intelligence in complex scenarios.

Relevant Links

Research Paper: DeepSeek-VL: Towards Real-World Vision-Language Understanding
Authors: Haoyu Lu, Wen Liu, Bo Zhang, et al.
Implementation: github.com/deepseek-ai/DeepSeek-VL
Models Weights: huggingface.co/collections/deepseek-ai/deepseek-vl

CC: @Benjamin-eecs, @RERV (github contributors of deepseek-ai/DeepSeek-VL)

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker, @Rocketknight1, @Cyrilvallez

TODOs

geetu040 and others added 11 commits February 18, 2025 12:27

upload initial code

f3d1896

update deepseek-vl adaptor

b904f22

update hierarchy of vision model classes

7d44bee

udpate aligner model

a3734d6

Merge branch 'main' into deepseek-vl

d0305b2

add text model

abea4eb

Added Image Processor

65886ec

Added Image Processor

19a7666

Added Image Processor

9c3c544

apply masks

1e49a1f

Merge remote-tracking branch 'fork/deepseek-vl' into deepseek-vl

972ee16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for DeepseekAI's DeepseekVL #36248

Add support for DeepseekAI's DeepseekVL #36248

geetu040 commented Feb 18, 2025

Add support for DeepseekAI's DeepseekVL #36248

Are you sure you want to change the base?

Add support for DeepseekAI's DeepseekVL #36248

Conversation

geetu040 commented Feb 18, 2025

What does this PR do?

Before submitting

Who can review?

TODOs