How to reuse best checkpoint and turned hparams.yaml to predict results #39

YonDraco · 2024-04-08T01:57:56Z

I used best checkpoint after turning then used get_lag_llama_predictions function to predict from best checkpoint . However, the predicted result is much lower than when I turned and trained that checkpoint.

The text was updated successfully, but these errors were encountered:

ashok-arjun · 2024-04-08T21:44:16Z

Sorry, I don't understand. Can you elaborate? Which checkpoint did you use?

YonDraco · 2024-04-09T02:05:55Z

@ashok-arjun I saved the best checkpoint epoch=36-step=1850.ckpt after turning and used the get_lag_llama_predictions function as in colab demo 2 to make predictions from this checkpoint. However, when loading this checkpoint to make predictions, the results are worse than when turning. So I think I will have to load both checkpoint and hparams.yml but I don't know how to handle them

ashok-arjun · 2024-04-09T14:18:53Z

Sorry, I don't understand what "turning" is. Do you mean finetuning or training?

YonDraco closed this as completed Apr 8, 2024

YonDraco reopened this Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to reuse best checkpoint and turned hparams.yaml to predict results #39

How to reuse best checkpoint and turned hparams.yaml to predict results #39

YonDraco commented Apr 8, 2024 •

edited

ashok-arjun commented Apr 8, 2024

YonDraco commented Apr 9, 2024

ashok-arjun commented Apr 9, 2024 •

edited

How to reuse best checkpoint and turned hparams.yaml to predict results #39

How to reuse best checkpoint and turned hparams.yaml to predict results #39

Comments

YonDraco commented Apr 8, 2024 • edited

ashok-arjun commented Apr 8, 2024

YonDraco commented Apr 9, 2024

ashok-arjun commented Apr 9, 2024 • edited

YonDraco commented Apr 8, 2024 •

edited

ashok-arjun commented Apr 9, 2024 •

edited