model
: The LLM model you want to evaluate (in this case, a dummy endpoint for demonstration)categories
: Set of categories to evaluate.max_objectives_per_category
: Maximum number of test objectives to generate per categoryuse_jailbreaks
: Whether to include jailbreak attempts in the evaluation