Skip to content

dwvisser/word_cloud

This branch is 1 commit ahead of, 2 commits behind amueller/word_cloud:main.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

8416df9 · Sep 24, 2024
Sep 15, 2024
Oct 4, 2023
Aug 10, 2020
Dec 5, 2023
Sep 24, 2024
Jul 25, 2018
Dec 7, 2023
May 5, 2023
Apr 8, 2020
Nov 20, 2017
Dec 11, 2013
Dec 9, 2023
Dec 7, 2023
Dec 7, 2023
Dec 5, 2023
Dec 5, 2023
Dec 7, 2023
Dec 7, 2023

Repository files navigation

licence DOI

word_cloud

A little word cloud generator in Python. Read more about it on the blog post or the website.

The code is tested against Python 3.7, 3.8, 3.9, 3.10, 3.11, 3.12.

Installation

If you are using pip:

pip install wordcloud

If you are using conda, you can install from the conda-forge channel:

conda install -c conda-forge wordcloud

Installation notes

wordcloud depends on numpy, pillow, and matplotlib.

If there are no wheels available for your version of python, installing the package requires having a C compiler set up. Before installing a compiler, report an issue describing the version of python and operating system being used.

Examples

Check out examples/simple.py for a short intro. A sample output is:

Constitution

Or run examples/masked.py to see more options. A sample output is:

Alice in Wonderland

Getting fancy with some colors: Parrot with rainbow colors

Generating wordclouds for Arabic:

Arabic wordlcloud

Command-line usage

The wordcloud_cli tool can be used to generate word clouds directly from the command-line:

$ wordcloud_cli --text mytext.txt --imagefile wordcloud.png

If you're dealing with PDF files, then pdftotext, included by default with many Linux distribution, comes in handy:

$ pdftotext mydocument.pdf - | wordcloud_cli --imagefile wordcloud.png

In the previous example, the - argument orders pdftotext to write the resulting text to stdout, which is then piped to the stdin of wordcloud_cli.py.

Use wordcloud_cli --help so see all available options.

Licensing

The wordcloud library is MIT licenced, but contains DroidSansMono.ttf, a true type font by Google, that is apache licensed. The font is by no means integral, and any other font can be used by setting the font_path variable when creating a WordCloud object.

About

A little word cloud generator in Python

Resources

License

Citation

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.5%
  • Cython 1.5%