-
Notifications
You must be signed in to change notification settings - Fork 479
FEAT: Toxic Sentence Generator #893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
@microsoft-github-policy-service agree |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great, a couple of comments
Co-authored-by: Roman Lutz <[email protected]>
- Add Optional type hint to prompt_template parameter - Add unit test for case when no template is provided - Update API reference documentation to include ToxicSentenceGeneratorConverter
…ference - Remove documentation notebook and Python file as recommended by reviewer - Add explicit GitHub link to Project Moonshot's toxic sentence generator module
#901 should fix the build issues you're seeing here. Once it is merged, just merge main into your branch and it should take care of the build issues |
I think you need to run pre-commit to fix the formatting |
Do I run |
Description
This PR adds the
ToxicSentenceGeneratorConverter
converter adapted from Project Moonshot's attack modules. The converter generates toxic sentence starters based on seed prompts to test language model safety guardrails within the PyRIT framework.Work Completed
ToxicSentenceGeneratorConverter
class intoxic_sentence_generator_converter.py
ToxicSentenceGeneratorConverter
classRelated Issue
Contributes to issue #428 and its parent issue #376