TrustTest provides several specialized heuristic evaluators:
Regex Evaluator: Uses regular expressions to validate response patterns
Equals Evaluator: Checks if responses exactly match expected values
BLEU Evaluator: Measures the similarity between responses using the BLEU score metric
Expected Language Evaluator: Verifies if responses are in the expected language
Equal Language Evaluator: Compares the language of responses to ensure consistency
While heuristic evaluators are fast and consistent, we recommend using LLM as a Judge evaluators when possible as they can better understand semantic relationships and reason about content in a more human-like way.