Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Look at using Word2vec #9

Open
waldoj opened this issue Sep 24, 2015 · 1 comment
Open

Look at using Word2vec #9

waldoj opened this issue Sep 24, 2015 · 1 comment
Labels

Comments

@waldoj
Copy link
Member

waldoj commented Sep 24, 2015

I speculate that it might be helpful to use Word2vec as a text pre-processor. Caveat: I barely know what I'm talking about.

@waldoj
Copy link
Member Author

waldoj commented Oct 27, 2015

I've been exploring this a bit today. It's a fine little program, quite speedy, but feeding it unstructured text yields unstructured results. Another problem is that, at present, I'm only giving it a single source of data. Once we're feeding it multiple dictionaries, it should be able to extract better results.

Again, the text needs to be pre-processed (per #10), and we've really got to gin up some method of isolating each term and its definition, instead of blurring them together with the prior and following term and definition.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant