π Step By Step Reasoning CoherencyΒΆ
metrics.llm_as_judge.direct.criteria.step_by_step_reasoning_coherency
CriteriaWithOptions(
name="step_by_step_reasoning_coherency",
description="On a scale of 1 (sounds like nonsense) to 5 (easy to parse), does the whole generated response make sense? (Ie, does it sound understandable/non-contradictory/sensible, even if it fails to address the context?)",
prediction_field="generated response",
context_fields=[
"question",
"premise",
"hypothesis",
"correct answer",
],
options=[
CriteriaOption(
name="1",
description="",
),
CriteriaOption(
name="2",
description="",
),
CriteriaOption(
name="3",
description="",
),
CriteriaOption(
name="4",
description="",
),
CriteriaOption(
name="5",
description="",
),
],
option_map={
"1": 0.0,
"2": 0.25,
"3": 0.5,
"4": 0.75,
"5": 1.0,
},
)
[source]from unitxt.llm_as_judge_constants import CriteriaOption
Explanation about CriteriaWithOptionsΒΆ
Criteria used by DirectLLMJudge to run evaluations.
Explanation about CriteriaOptionΒΆ
A criteria option.
Read more about catalog usage here.