Simple Transformer

An implementation of the "Attention is all you need" paper without extra bells and whistles, or difficult syntax.

Note: The only extra thing added is Dropout regularization in some layers and option to use GPU. Also, a lot more steps might be needed to get the model to work very well. For instance, improving the inference speed, or have a schedule to shift away from teacher forcing. I haven't experimented much with those, and this repository is meant for a reference to the basic transformer setup. It's complementary to other blog posts/videos/paper one might find online.

Install

python -m pip install -r requirements.txt

Toy data

python train_toy_data.py

English -> German Europarl dataset

python train_translate.py

Training on a small subset of 1000 sentences (Included in this repo)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Dataset		Dataset
Extra_Scripts		Extra_Scripts
Transformer		Transformer
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
save_gif.py		save_gif.py
train_toy_data.py		train_toy_data.py
train_translate.py		train_translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple Transformer

Install

Toy data

English -> German Europarl dataset

About

Releases

Packages

Contributors 3

Languages

IpsumDominum/Pytorch-Simple-Transformer

Folders and files

Latest commit

History

Repository files navigation

Simple Transformer

Install

Toy data

English -> German Europarl dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages