Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train Naive Bayes Classifier ? #4

Open
JQuags opened this issue Aug 22, 2020 · 5 comments
Open

How to train Naive Bayes Classifier ? #4

JQuags opened this issue Aug 22, 2020 · 5 comments

Comments

@JQuags
Copy link

JQuags commented Aug 22, 2020

Is there more information on how to train the classifier?

I see in the source classifier.json is currently private, which explains the broken links on the site.

The source indicates removing classifier.json, should be all that is needed to train and set SPAM_CATEGORY and SCAN_DIRECTOR. Is that all then feed a directory of spam or ham in EML or ARF format?

@wis
Copy link

wis commented Sep 13, 2020

I thought you provided a well trained classifier.json, the link in the README 404s, why was it removed? @niftylettuce

@JQuags
Copy link
Author

JQuags commented Sep 14, 2020

  • (spam dataset is private at the moment) - is in the comments

I suspect it never has been provided, and there may be privacy reason.

@niftylettuce
Copy link
Contributor

niftylettuce commented Sep 14, 2020

I should have this published in the near future. Currently I had to put my focus on something else. But this is not a privacy concern anymore as I have sha256 hashed all the tokens.

@wis
Copy link

wis commented Sep 16, 2020

good! can we contribute to the training data by forwarding spam emails from our inbox to an email address you setup?

@niftylettuce
Copy link
Contributor

niftylettuce commented Sep 16, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants