⚡ Quick Start

This guide will show you how to install and run dtx using the built-in ECHO dummy agent and datasets like Garak or Airbench, without needing heavy models or external APIs.

No external API keys required No local models required Safe for quick testing and sandboxing workflows

1. Prerequisites

Make sure you have:

Python >= 3.10
Git (optional, for pulling templates)

To check:

python --version
git --version

2. Install `uv` (Recommended)

uv is a lightweight toolchain manager that handles virtual environments and tool installs seamlessly. To install:

Follow the instructions on the official Astral docs: https://docs.astral.sh/uv/getting-started/installation/

3. Install `dtx`

You have two options:

Option A: Install via `uv` (Preferred)

This will create and manage a virtual environment for you.

uv tool install "dtx[torch]"

To upgrade dtx later:

uv tool update dtx

Option B: Install via `pip`

If you already manage environments yourself, you can install directly:

pip install "dtx[torch]"

4. Run `dtx`

Option 1: Quick Evaluation using Dummy Agent + Airbench Dataset

This will run a red team test using:

ECHO agent (simulated replies)
IBM Granite HAP 38M model to evaluate responses
Airbench dataset (default dataset)

dtx redteam run --agent echo --eval ibm38

You will see generated prompts, responses, and evaluation results printed in your terminal!

Option 2: Run Dummy Agent with Garak Signature Dataset

This uses:

ECHO agent
Garak dataset: a collection of jailbreak prompt signatures
No evaluator required — garak dataset already contains signature rules.

dtx redteam run --agent echo --dataset garak -o

Outputs simulated responses and matched signatures from the Garak dataset.

5. Output

By default, results are saved to:

report.yml

You can open this YAML file to inspect prompts, responses, and evaluation outcomes.

Optional: customize the output file with:

dtx redteam run --agent echo --dataset garak -o --yml my_report.yml

🎉 Next Steps

Once you are comfortable with dtx, you can:

Try different datasets: airbench, beaver, jbb, etc.
Explore evaluators: ibm38, ibm125, keyword, jsonpath.
Move to real models by replacing echo with your provider (e.g., huggingface, gradio, etc.)

To list available datasets:

dtx datasets list

To list available evaluation methods:

dtx tactics list

1. Prerequisites​

2. Install uv (Recommended)​

3. Install dtx​

Option A: Install via uv (Preferred)​

Option B: Install via pip​

4. Run dtx​

Option 1: Quick Evaluation using Dummy Agent + Airbench Dataset​

Option 2: Run Dummy Agent with Garak Signature Dataset​

5. Output​

🎉 Next Steps​