A new SOTA text recognition architecture - SVIPTR #1826
milosacimovic
started this conversation in
Ideas
Replies: 3 comments 2 replies
-
Hi @milosacimovic 👋🏼, Thanks for sharing with us, I will have a look on it after vacation 😊 |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi @felixdittrich92 👋, Happy holidays! 🎄 |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I would like to suggest possibly introducing another state-of-the-art text recognition architecture to docTR.
SVIPTR
It's promising accurate results at low latency.
Notably, the SVIPTR-T (Tiny) variant delivers highly competitive accuracy on par with other lightweight models and achieves SOTA inference speeds. Meanwhile, the SVIPTR-L (Large) attains SOTA accuracy in single-encoder-type models, while maintaining a low parameter count and favorable inference speed.
Thanks for your consideration.
Beta Was this translation helpful? Give feedback.
All reactions