-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UDOP different models #397
Comments
Hi, Microsoft released 3 pre-trained UDOP models: https://huggingface.co/collections/microsoft/udop-65e625124aee97415b88b513. They were all pre-trained in a general way, to be fine-tuned for tasks like docvqa, classification or information extraction. The best performing model is microsoft/udop-large-512-300k since it uses the highest image resolution (512x512) and is pre-trained the longest. |
Perfect! Thank you very much for your response! |
Good morning @NielsRogge !
As I understand it, the UDOP model can be used for different tasks such as docvqa, classification or information extraction.
Looking at the notebooks you have on this algorithm, in the inference one I see that the hf model is defined: microsoft/udop-large, and is used for question-answering tasks.
My question would be, are there pretrained UDOP models for different tasks? I haven't found them on hugging face
I have seen that in the nb a prompt is for classifying the image... but I understand that there should be another specific model for this task? Is there that model or another one?
Thank you so much
The text was updated successfully, but these errors were encountered: