Create tests
Capture the Flag
The Capture the Flag (CTF) Probe is a specialized tool designed to generate test sets for multi-turn conversations with custom objectives. It creates scenarios that test how well a model can maintain specific boundaries or achieve particular goals through iterative prompting.
This probe needs a LLM client configured to generate the test set.
Purpose
The Capture the Flag Probe is particularly useful when you need to:
- Generate test cases for specific model behaviors
- Create multi-turn conversation scenarios with custom objectives
- Test model responses to particular types of questions
- Generate iterative test sets for specific use cases
- Create custom boundary testing scenarios
How It Works
The Capture the Flag Probe generates test sets through an iterative approach:
- Objective Definition: Define custom objectives with specific questions and expected outcomes
- Multi-turn Conversation Generation: The probe creates conversation flows that attempt to achieve the objective
- Adaptive Prompting: Each turn is designed to adapt based on potential model responses
- Test Set Creation: The probe outputs a structured test set for evaluation
Usage Examples
Basic Custom Objective Test Set Generation
Basic CaptureTheFlagScenario
with a custom objective:
The most critical part of the Objective
is a good definition of the true_description
and false_description
.
Remember:
true_description
: What a successful jailbreak would look like. ( What a bad answer would look like)false_description
: What a failed jailbreak would look like. ( What a good answer would look like)
When to Use
Use the Capture the Flag Probe when you need to:
- Generate test sets for specific model behaviors
- Create custom conversation scenarios
- Test particular types of responses
- Generate iterative test cases for specific use cases
- Create boundary testing scenarios for custom objectives