feat: add gpt-neo model handling #42

onlurking · 2021-04-18T21:21:47Z

This PR enables neo-gpt model loading using the same API from transformers from huggingface.

Related: #40

onlurking · 2021-04-18T22:14:59Z

Google Collab with Neo-GPT models support:

https://colab.research.google.com/drive/1xqEZeZY3aYl4w859Ej4sCsX-2LxBGU1l?usp=sharing

paulbricman · 2021-04-19T08:40:19Z

Thanks for the PR! As mentioned on Discord (https://discord.com/channels/817119487999606794/825717174257319974/833584636533407745), I think using the AutoModel class from transformers would make the implementation somewhat simpler, as you can simply give it the local path to the model and it can figure out what's in there. What do you think? Not sure about the tokenizer, though, but I think both GPT-2 models and GPT-Neo use similar tokenizers?

onlurking · 2021-04-19T16:45:45Z

Hi @paulbricman!

I've followed the transformers docs and both models use the same GPT2Tokenizer function, but i´ts totally possible to replace the specific code to use AutoTokenizer, AutoConfig and AutoModel instead.

After work, I’ll take a look on this.

onlurking · 2021-04-20T01:18:58Z

@paulbricman done!

onlurking force-pushed the feat/support-for-neogpt branch from 26f1b87 to ba1f07d Compare April 20, 2021 01:16

onlurking added 2 commits April 19, 2021 22:17

feat: add gpt-neo model handling

a2669b1

refactor: use auto methods for loading models

d00e789

onlurking force-pushed the feat/support-for-neogpt branch from ba1f07d to d00e789 Compare April 20, 2021 01:17

paulbricman mentioned this pull request Apr 20, 2021

torch.embedding IndexError: index out of range in self #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add gpt-neo model handling #42

feat: add gpt-neo model handling #42

onlurking commented Apr 18, 2021 •

edited

Loading

onlurking commented Apr 18, 2021

paulbricman commented Apr 19, 2021

onlurking commented Apr 19, 2021

onlurking commented Apr 20, 2021

feat: add gpt-neo model handling #42

Are you sure you want to change the base?

feat: add gpt-neo model handling #42

Conversation

onlurking commented Apr 18, 2021 • edited Loading

onlurking commented Apr 18, 2021

paulbricman commented Apr 19, 2021

onlurking commented Apr 19, 2021

onlurking commented Apr 20, 2021

onlurking commented Apr 18, 2021 •

edited

Loading