Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

native resolution support on larger size #17

Open
MonolithFoundation opened this issue Nov 27, 2024 · 4 comments
Open

native resolution support on larger size #17

MonolithFoundation opened this issue Nov 27, 2024 · 4 comments

Comments

@MonolithFoundation
Copy link

Hi, would consider opensource larger native resolution model especially the 1B one?

@DonkeyShot21
Copy link

Thank you for your interest! At the moment we do not have plans to release larger native resolution models. However, we appreciate your feedback and will keep this in mind.

@lucasjinreal
Copy link

lucasjinreal commented Nov 30, 2024

Hello, I am likewise extremely interested in large native models (even those of a size as extensive as 1B).

Currently, vision encoders (VEs) of fixed dimensions are not particularly efficacious when it comes to comprehending large resolutions, such as in document understanding and other related domains. (we can only employ interpolation which significantly sacrifices accuracy).

I hope that your team contemplates open-sourcing large native models to confer more advantages on the community.

@aelnouby
Copy link
Collaborator

aelnouby commented Dec 2, 2024

Thanks for your feedback, @MonolithFoundation and @lucasjinreal! We will consider adding native resolution support for the higher capacity models in our short/mid-term plans and will keep you updated.

@MonolithFoundation
Copy link
Author

Thank u so much for the consideration!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants