Purpose
The Equals Evaluator is particularly useful when you need to:- Verify exact string matches
- Ensure responses match predefined templates
- Validate fixed-format outputs
- Check for precise command or code outputs
- Test exact response requirements
How It Works
The evaluator uses a simple binary scoring system:- Score: 1: The response exactly matches the expected output
- Score: 0: The response does not match the expected output
Usage Example
- A binary score (0 or 1) indicating exact match status
- A list of explanations including:
- Success message if matched
- Failure message with both expected and received responses if not matched
When to Use
Use the Equals Evaluator when you need to:- Verify exact command outputs
- Validate fixed-format responses
- Check for precise string matches
- Test template-based responses
- Ensure exact compliance with specifications
- Validate code snippets or commands