Skip to content

Jupyter Python notebook project for creating a topic modelling corpus from the BNC

License

Notifications You must be signed in to change notification settings

lawsofthought/tantalum

Repository files navigation

Jupyter Python notebook to make a topic modelling corpus

The notebook creates a corpus of short documents from the BNC for use with probabilistic topic models, particularly using the Gustav topic modelling toolbox.

It requires a Python package called bnctools and this will be installed, as will all other requirements, if you do `pip install -r requirements.txt'

About

Jupyter Python notebook project for creating a topic modelling corpus from the BNC

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published