SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

This repository contains source code of research paper "SAFL: A Self-Attention Scene Text Recognizer with Focal Loss", which is accepted as full-paper in 19th Internation Conference on Machine Learning and Applications. Tested on seven scene text recognition benchmarks including: IIIT5K, SVT, ICDAR03, ICDAR13, ICDAR15, SVT-P and CUTE.

@inproceedings{tran2020safl,
  title={SAFL: A Self-Attention Scene Text Recognizer with Focal Loss},
  author={Tran, Bao Hieu and Le-Cong, Thanh and Nguyen, Huu Manh and Le, Duc Anh and Nguyen, Thanh Hung and Le Nguyen, Phi},
  booktitle={2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)},
  pages={1440--1445},
  year={2020},
  organization={IEEE}
}

Installation

conda env create -f environment.yml
conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=10.0 -c pytorch

Train

bash scripts/stn_att_rec.sh

Test

You can test with .lmdb files by

bash scripts/main_test_all.sh

Or test with single image by

bash scripts/main_test_image.sh

Data preparation

We give an example to construct your own datasets. Details please refer to tools/create_svtp_lmdb.py.

Citation

If you find this project helpful for your research, please cite the following papers:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
lib		lib
scripts		scripts
.gitignore		.gitignore
README.md		README.md
config.py		config.py
demo.py		demo.py
environment.yml		environment.yml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Installation

Train

Test

Data preparation

Citation

About

Contributors 3

Languages

thanhlecongg/SAFL

Folders and files

Latest commit

History

Repository files navigation

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Installation

Train

Test

Data preparation

Citation

About

Topics

Resources

Stars

Watchers

Forks

Contributors 3

Languages