Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible to retrain/keep training an existing model? #102

Open
petulla opened this issue Jun 22, 2020 · 5 comments
Open

Possible to retrain/keep training an existing model? #102

petulla opened this issue Jun 22, 2020 · 5 comments
Labels
feature-request question Further information is requested

Comments

@petulla
Copy link

petulla commented Jun 22, 2020

Hi

Given a loaded model, is it possible to train it with more data?

@alvations
Copy link
Contributor

Which model? Do you mean the truecasing model? Other than that, there's no real model training in sacremoses , it's lots of regex rules writing and testings =)

@alvations alvations added the question Further information is requested label Jun 23, 2020
@petulla
Copy link
Author

petulla commented Jun 23, 2020

I meant a model already trained with sacremoses.. in other words, can you load an existing model and keep training (add more rules).

@alvations
Copy link
Contributor

May I ask which preprocessing task are you referring to in sacremoses? The truecaser?

For other tasks, there's no training involved and the rules are manually defined 😅

@petulla
Copy link
Author

petulla commented Jun 23, 2020

yep truecaser.

to clarify

let's say i load some text into sacremoses, i train for truecasing.

then two days later, i have some new text. i want to update the model.

i want to keep training the existing model with new text rather than start from scratch.

@petulla petulla closed this as completed Jul 9, 2020
@alvations
Copy link
Contributor

P/S: I'm thinking about how to put this feature in. It's not hard but just have to think a little about the user's usage logic =)

I'm a little busy these couple of days. But please keep this issue open, I'll look into it because I think it's worth a try.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature-request question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants