update: Update VLMs Data Collator #55

rs545837 · 2025-02-21T17:57:52Z

Addressing first part of this issue #1559

Make text & image mixing work efficiently -so some inputs can be text only. Must work on Qwen, Llama, Pixtral.

Works with all the models mentioned above.

y22ma · 2025-07-08T20:19:23Z

This is awesome. Will try to single this collator as a separate implementation so that I can actually train against my mix text and image+text dataset!

Let me know if there's anything I could do to experdite this.

Update vision_utils.py

a1e5f08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update: Update VLMs Data Collator #55

update: Update VLMs Data Collator #55

Uh oh!

rs545837 commented Feb 21, 2025 •

edited

Loading

Uh oh!

y22ma commented Jul 8, 2025

Uh oh!

Uh oh!

update: Update VLMs Data Collator #55

Are you sure you want to change the base?

update: Update VLMs Data Collator #55

Uh oh!

Conversation

rs545837 commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

y22ma commented Jul 8, 2025

Uh oh!

Uh oh!

rs545837 commented Feb 21, 2025 •

edited

Loading