π Step By Step Reasoning ArithmeticΒΆ
metrics.llm_as_judge.direct.criteria.step_by_step_reasoning_arithmetic
CriteriaWithOptions(
name="step_by_step_reasoning_arithmetic",
description="Does this step contain any math equation errors? Note that you should consider only current step in isolation, rather than issues propagated from prior steps.",
prediction_field="step",
context_fields=[
"question",
"premise",
"hypothesis",
"model reasoning",
"correct answer",
],
options=[
CriteriaOption(
name="Yes",
description="",
),
CriteriaOption(
name="No",
description="",
),
],
option_map={
"Yes": 0.0,
"No": 1.0,
},
)
[source]from unitxt.llm_as_judge_constants import CriteriaOption
Explanation about CriteriaWithOptionsΒΆ
Criteria used by DirectLLMJudge to run evaluations.
Explanation about CriteriaOptionΒΆ
A criteria option.
Read more about catalog usage here.