Lemmy Mod Bot

A Lemmy bot written in Python that allows you to automatically moderate communities from toxicity, duplicate posts, and CSAM content.

This bot is a work in progress and is expected to expand and grow.

Running the bot

A Docker image is provided for the purposes of easily running the moderation bot in a containerised environment. To run a bot with just the toxicity detection an example docker-compose file has been provided. To set up further modules (as detailed further below), mount a replacement main.py file at /app/main.py.

The bot can also be run un-containerised, either by cloning the repo, or by using the pip package.

Modules

Different aspects of moderation are divided into "Processors". These scan and report content for a single kind of violation, and can be configured individually. Currently, there are eight different processors:

BlacklistProcessor - This processor allows moderators to provide a list of blacklisted words. If they are encountered in any comment or post, a report is generated.
MimeWhitelistProcessor - This processor whitelists MIME-types of files that are permitted in the community. If a comment or post where this doesn't hold is encountered, a report is generated.
MimeBlacklistProcessor - Same as the previously discussed whitelist, except a blacklist
PhashProcessor - Calculates and tracks a perceptual hash (phash) for each post. If the image has been uploaded before, the bot posts a comment linking to the duplicates.
TitleConformityProcessor - Enforces a certain title structure (specified using Regex) and warns a poster if their title does not match.
UserProcessor - Automatically reports content posted by a listed user.
ToxicityProcessor - Assesses the toxicity of any post/comment and automatically files a report if it is above a specified threshold.
PhotoDNAProcessor (untested) - Given a Microsoft PhotoDNA API Key, this processor checks posts for Child Sexual Abuse Material (CSAM). If any violating content is found, the post is automatically deleted. Reports of this nature are sent to the provided Matrix server. This processor is untested as I am unable to obtain an API key.
SpamImageProcessor - Given a collection of phashes (which can be calculated using the included tool phash_tool.py), will scan through all posts and comments in the community and delete any containing the blacklisted images.

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
.github		.github
alembic		alembic
assets		assets
lemmymodbot		lemmymodbot
tests		tests
training		training
.gitignore		.gitignore
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__about__.py		__about__.py
alembic.ini		alembic.ini
docker-compose.example.yml		docker-compose.example.yml
exclude.lst		exclude.lst
gitversion.yml		gitversion.yml
main.py		main.py
phash_tool.py		phash_tool.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
train.tsv		train.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lemmy Mod Bot

Running the bot

Modules

About

Releases 100

Packages

Languages

emilymclean/LemmyModBot

Folders and files

Latest commit

History

Repository files navigation

Lemmy Mod Bot

Running the bot

Modules

About

Topics

Resources

Stars

Watchers

Forks

Releases 100

Packages 0

Languages

Packages