EvaluationScenario, TestSet and EvaluationTestResult.
Client
To track all our evaluation result we use thetrusttest.client.
Currently, we support two client types:
- file-system: Save the results in the local filesystem.
- neuraltrust: Save the results in the remote NeuralTrust server.
neuraltrust client, target_id is required so saved scenarios, test sets, and runs are scoped to the right NeuralTrust target. You can pass it directly or load it from NEURALTRUST_TARGET_ID.
Save a scenario results
First we need to define our scenario, run the evaluation and get the results.TestSet:
EvaluationTestResult:
file-system client, you can see the results in the trusttest_db folder.