Deep-Roo

wrangling kangaroo words

Overview

a kangaroo word contains a subsequence of characters that form a different word with the same meaning as that kangaroo word - the idea being that a kangaroo carries a little kangaroo in its pouch.

abnormality is a kangaroo word containing anomaly.

i want to discover kangaroo words, but rather than use a thesaurus, we can compute vector embeddings of words using node2vec. by ranking valid word subsequences by their proximity to the containing word, approximate kangaroo words can be found. because word2vec embeddings can be used to infer analogous relationships, we might even be able to find classes of kangaroo-like words containing subsequences that fit particular analogies.

my plan is to train a word2vec model on wikipedia's data, and then use that model to rank vocabulary words' subsequences.

Deep-Roo Docker

build Deep-Roo

from this directory run:

docker build -t deep_roo .

run Deep-Roo

copy config.example.sh to config.sh:

cp config.example.sh config.sh

in your config.sh file, replace the values of environment variables:

CORPUS_DIR: local path where your word2vec training corpus lives
- either include your own corpus, or use wikipedia parser (TODO)
MODEL_DIR: local path where word2vec model will be written and persisted

source your config file:

. config.sh

then spin up the container:

docker-compose up -d

Deep-Roo commands

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
deep_roo		deep_roo
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.example.sh		config.example.sh
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep-Roo

Overview

Deep-Roo Docker

build Deep-Roo

run Deep-Roo

Deep-Roo commands

About

Uh oh!

Releases

Packages

Languages

License

Patrickdlg/Deep-Roo

Folders and files

Latest commit

History

Repository files navigation

Deep-Roo

Overview

Deep-Roo Docker

build Deep-Roo

run Deep-Roo

Deep-Roo commands

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages