This version switches to a modern build system and adds a utility for stripping remaining tags that may be left by WikiExtractor for older Wikipedia dumps.
- The build system is defined by the
pyproject.toml
file. - Setup options are now mostly set by
setup.cfg
. - The
wiki_remove_tags
script can be used to strip out some common remaining tags from extracted Wikipedia text.