⚡ Quick Start
This guide will show you how to install and run dtx
using the built-in ECHO dummy agent and datasets like Garak or Airbench, without needing heavy models or external APIs.
No external API keys required
No local models required
Safe for quick testing and sandboxing workflows
1. Prerequisites
Make sure you have:
- Python
>= 3.10
- Git (optional, for pulling templates)
To check:
python --version
git --version
2. Install dtx
Install the core dtx
CLI tool:
pip install dtx[torch]
3. Run dtx
Option 1: Quick Evaluation using Dummy Agent + Airbench Dataset
This will run a red team test using:
- ECHO agent (simulated replies)
- IBM Granite HAP 38M model to evaluate responses
- Airbench dataset (default dataset)
dtx redteam run --agent echo --eval ibm38
You will see generated prompts, responses, and evaluation results printed in your terminal!
Option 2: Run Dummy Agent with Garak Signature Dataset
This uses:
- ECHO agent
- Garak dataset: a collection of jailbreak prompt signatures
- No evaluator required —
garak
dataset already contains signature rules.
dtx redteam run --agent echo --dataset garak -o
Outputs simulated responses and matched signatures from the Garak dataset.
4. Output
By default, results are saved to:
report.yml
You can open this YAML file to inspect prompts, responses, and evaluation outcomes.
Optional: customize the output file with:
dtx redteam run --agent echo --dataset garak -o --yml my_report.yml
🎉 Next Steps
Once you are comfortable with dtx
, you can:
- Try different datasets:
airbench
,beaver
,jbb
, etc. - Explore evaluators:
ibm38
,ibm125
,keyword
,jsonpath
. - Move to real models by replacing
echo
with your provider (e.g.,huggingface
,gradio
, etc.)
To list available datasets:
dtx datasets list
To list available evaluation methods:
dtx tactics list