Feature Request: Janus-Pro integration #339

miasik · 2025-02-19T22:46:32Z

Janus-Pro is an advanced version of the previous work Janus. Specifically, Janus-Pro incorporates (1) an optimized training strategy, (2) expanded training data, and (3) scaling to larger model size. With these improvements, Janus-Pro achieves significant advancements in both multimodal understanding and text-to-image instruction-following capabilities, while also enhancing the stability of text-to-image generation.
Janus is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by decoupling visual encoding into separate pathways, while still utilizing a single, unified transformer architecture for processing. The decoupling not only alleviates the conflict between the visual encoder’s roles in understanding and generation, but also enhances the framework’s flexibility. Janus surpasses previous unified model and matches or exceeds the performance of task-specific models. The simplicity, high flexibility, and effectiveness of Janus make it a strong candidate for next-generation unified multimodal models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Janus-Pro integration #339

Feature Request: Janus-Pro integration #339

miasik commented Feb 19, 2025

Feature Request: Janus-Pro integration #339

Feature Request: Janus-Pro integration #339

Comments

miasik commented Feb 19, 2025