-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to get similarity score with 2 sentences test #2
Comments
Check out this line:
|
I tried out and got the score.
Why is that? Are the model correct? |
You mean evaluating each batch of test set sentences consisting of 64 sentence pairs? |
Yes, I did. I just want to evaluate with different and smaller test data sets, not in order or somethings like that. |
You showed three different sets of "Evaluation metrics for test". I'm guessing you are wondering why the results differ so much. Do you mind explaining what you did to get the pearson_r, spearman_r, etc.. for those three sets of data? |
You right. That's why I wonder. Could you explain to me? Thanks. |
How many epochs did you train for? If the model is not trained very well (high bias in training set) then we can expect to get poor results on the smaller test sets. They vary wildly since there is variation in the different small test sets you created. However, after you train the model properly (low bias in training and dev set), I think you can expect to see better test set metrics and more consistent performance among different test sets. Note for the model to be trained well the hyperparameters also play an extremely important role. |
I trained with: You can use |
Hmm, sorry I missed the notification. Doing some error analysis is on my TODO list. |
The model's output is a torch.cuda.FloatTensor. How can I get real score between 2 sentences?
The text was updated successfully, but these errors were encountered: