Skip to content

implement stopword filter #8

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Tracked by #33 ...
cwulfman opened this issue Mar 6, 2023 · 1 comment
Open
Tracked by #33 ...

implement stopword filter #8

cwulfman opened this issue Mar 6, 2023 · 1 comment

Comments

@cwulfman
Copy link
Contributor

cwulfman commented Mar 6, 2023

Use NLTK English stopword list in initial implementation.

Then make it possible to choose from other stopword lists from a database of stopword lists.

@jswatsch
Copy link

These are the stopwords lists we are using: https://github.com/nltk/nltk_data/blob/gh-pages/packages/corpora/stopwords.zip.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants