HackAgent is an open-source toolkit designed to help security researchers, developers and AI safety practitioners evaluate the security of AI agents. It provides a structured approach to discover potential vulnerabilities, including prompt injection, jailbreaking techniques, and other attack vectors.
- Comprehensive Attack Library: Pre-built techniques for prompt injections, jailbreaks, and goal hijacking
- Modular Framework: Easily extend with custom attack vectors and testing methodologies
- Safety Focused: Responsible disclosure guidelines and ethical usage recommendations
HackAgent can be installed directly from PyPI:
# With uv (recommended)
uv add hackagent
# Or with pip
pip install hackagentRun the interactive CLI to start testing your AI agents:
hackagentObtain your credentials at https://app.hackagent.dev
For detailed examples and advanced usage, visit our documentation.
HackAgent automatically sends test results to the dashboard for analysis and visualization.
Access your dashboard at https://app.hackagent.dev
We welcome contributions! Please see CONTRIBUTING.md and CODE_OF_CONDUCT.md for guidelines.
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
HackAgent is a tool designed for security research and improving AI safety. Always obtain proper authorization before testing any AI systems. The authors are not responsible for any misuse of this software.
This project is for educational and research purposes. Always use responsibly and ethically.