Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hi, What if the prior probability of the noise label of the training set is not known? #2

Open
ZunxiaoXu opened this issue Oct 29, 2021 · 1 comment

Comments

@ZunxiaoXu
Copy link

No description provided.

@MuhammadSYahyaS
Copy link

MuhammadSYahyaS commented Dec 12, 2023

I also have similar issue. In the paper itself, it is being said that

The prior probability P(Y~|D~) is counted directly from the noisy dataset.

But, it is also being said that

Beyond their results, we attempt to propose a theoretically sound approach addressing a general instance-based noise regime without knowing or estimating noise rates.

In the implementation, if we set noise_prior = None, then

cores/phase1.py

Line 211 in cf0945e

noise_prior_cur = noise_prior*num_training_samples - noise_prior_delta
will raise an error. I have tried:

  • Initializing it with noise_prior = np.ones(num_classes)
  • Avoiding the two lines of noise_prior_cur update process

and all on CIFAR-10 dataset, but my best result was the best test acc simply dropped from 69.28 (using the dataset's known noise prior) to 46.54. Any lead about how to set the noise_prior on unknown prior probability of the dataset's noise? @haochenglouis

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants