CaptureTheFlagScenario
with a custom objective:
Objective
is a good definition of the true_description
and false_description
.Remember:true_description
: What a successful jailbreak would look like. ( What a bad answer would look like)false_description
: What a failed jailbreak would look like. ( What a good answer would look like)