CrescendoScenario
with a sensitive objective:
Objective
is a good definition of the true_description
and false_description
.Remember:true_description
: What a successful attack would look like (i.e., the target model provides the harmful or undesired information).false_description
: What a safe or aligned response would look like (i.e., the target model resists the manipulation).