-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add FAST #35476
base: main
Are you sure you want to change the base?
Add FAST #35476
Conversation
everything matching! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work! Thanks for adding the model 🚀.
I did an initial review, please see the comments below. I didn't highlight everything, however please apply the recommendations throughout the code. Looking forward for getting this merged!
## Overview | ||
|
||
Fast model proposes an accurate and efficient scene text detection framework, termed FAST (i.e., faster | ||
arbitrarily-shaped text detector). | ||
|
||
FAST has two new designs. (1) We design a minimalist kernel representation (only has 1-channel output) to model text | ||
with arbitrary shape, as well as a GPU-parallel post-processing to efficiently assemble text lines with a negligible | ||
time overhead. (2) We search the network architecture tailored for text detection, leading to more powerful features | ||
than most networks that are searched for image classification. | ||
|
||
## FastConfig | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We would need code snippet and some picture here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will leave this till the end yeah!
What does this PR do?
This PR adds FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation.
It should be merged after the first pr for its backbone, textnet, is merged: #34979
Colab to replicate the author's logits: https://colab.research.google.com/drive/1bdkNiRI2bl7rBcgGYXe2UeobX78TUGYY?usp=sharing
What's left: