Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

python3.6 train_lstm.py is giving Illegal instruction (core dumped) #1

Open
srini8080 opened this issue Dec 29, 2020 · 9 comments
Open

Comments

@srini8080
Copy link

python3.6 train_lstm.py is giving Illegal instruction (core dumped)

@srini8080
Copy link
Author

(567): (568): (571): --- modulename: _bootstrap_external, funcname: create_module
(921): (922): --- modulename: _bootstrap, funcname: _call_with_frames_removed
Illegal instruction (core dumped)

@safakkbilici
Copy link
Owner

It should work with the dependencies that I mentioned in the project. I think that this error is about hardware utilization. Did you run the code on CPU? May I learn your hardware properties?

@srini8080
Copy link
Author

srini8080 commented Dec 29, 2020 via email

@safakkbilici
Copy link
Owner

Okay, I don't know what is the problem, it works on my computer and my colleagues' computer. I suggest you to train the T5 model to make titles. We used seq2seq lstm as a baseline model, it didn't give good results. But T5 works very good. Also I suggest you to train T5 on colab.

@srini8080
Copy link
Author

srini8080 commented Dec 30, 2020 via email

@safakkbilici
Copy link
Owner

Yes, train.py is given for colab. It also contains script for mounting google drive inside, because it is more practical to save your checkpoints into arbitrary drive directory. By doing that you can use your model after with only loading checkpoints. In train.py model_args has

"best_model_dir": "/content/drive/My Drive/outputs/best_model",
"output_dir": "/content/drive/My Drive/outputs"

That means your checkpoints are saved into "/content/drive/My Drive/outputs" and "/content/drive/My Drive/outputs/best_model". Feel free to change them.

Then you can use generate.py for loading checkpoints with just a line:

model = T5Model("t5","/content/drive/My Drive/outputs/best_model", args=model_args)

But I do not know a dataset which contains strory/title pair :(

@srini8080
Copy link
Author

srini8080 commented Dec 30, 2020 via email

@safakkbilici
Copy link
Owner

If the data has article/title pairs within pandas dataframe, you can use it yes. Do not forget that make your input column name input_text and target column name (in this example titles) target_text.
Btw I haven't heard this data. May you send me a link?

@srini8080
Copy link
Author

srini8080 commented Dec 31, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants