π Step By Step Reasoning Bad Final AnswerΒΆ
metrics.llm_as_judge.direct.criteria.step_by_step_reasoning_bad_final_answer
CriteriaWithOptions(
name="step_by_step_reasoning_bad_final_answer",
description="Does this step contain a final step with an incorrect final answer? (If an explicit 'yes/no' is not provided, an exact match of the correct answer with respect to the question in the context must be given.)",
prediction_field="step",
context_fields=[
"question",
"premise",
"hypothesis",
"model reasoning",
"correct answer",
],
options=[
CriteriaOption(
name="Yes",
description="",
),
CriteriaOption(
name="No",
description="",
),
],
option_map={
"Yes": 0.0,
"No": 1.0,
},
)
[source]from unitxt.llm_as_judge_constants import CriteriaOption
Explanation about CriteriaWithOptionsΒΆ
Criteria used by DirectLLMJudge to run evaluations.
Explanation about CriteriaOptionΒΆ
A criteria option.
Read more about catalog usage here.