Different mixup loss function between code and paper #18

lukk47 · 2019-04-23T05:58:38Z

The mixup loss function in code is as below:
,
while the mixture should be down before feed into the loss function according to the paper.

Will these two loss functions have the same results?

kleinzcy · 2019-12-11T14:26:09Z

@LokLu It is the same.

So the two loss functions are the same.

Let me know if I am wrong.

lukk47 · 2019-12-12T02:15:56Z

@kleinzcy Your equations are correct. But the problem is that the 'pred' in your code are the logits output of the model instead of the softmax of pred.

lizc126 · 2020-11-07T03:10:37Z

@kleinzcy Your equations are correct. But the problem is that the 'pred' in your code are the logits output of the model instead of the softmax of pred.

Hi, just saw this and I am curious as well. But the criterion should ideally take the logits output instead of softmax of pred right?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different mixup loss function between code and paper #18

Different mixup loss function between code and paper #18

lukk47 commented Apr 23, 2019

kleinzcy commented Dec 11, 2019

lukk47 commented Dec 12, 2019

lizc126 commented Nov 7, 2020

Different mixup loss function between code and paper #18

Different mixup loss function between code and paper #18

Comments

lukk47 commented Apr 23, 2019

kleinzcy commented Dec 11, 2019

lukk47 commented Dec 12, 2019

lizc126 commented Nov 7, 2020