You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see batch_size =2 in all experiments of the paper. To make training faster, I set batch_size =128 or 256 on my dataset with the same model, however, the performance is very unsatisfying.
Could you tell me the reason why choosing so mall batch size?Thanks.
The text was updated successfully, but these errors were encountered:
bad-meets-joke
changed the title
Why does not large batch size like 128, 256 work well in my dataset?
Why does not large batch size like 128, 256 work well?
Apr 19, 2023
Hi,
I see
batch_size =2
in all experiments of the paper. To make training faster, I setbatch_size =128 or 256
on my dataset with the same model, however, the performance is very unsatisfying.Could you tell me the reason why choosing so mall batch size?Thanks.
The text was updated successfully, but these errors were encountered: