π Step By Step Reasoning Overall QualityΒΆ
metrics.llm_as_judge.direct.criteria.step_by_step_reasoning_overall_quality
CriteriaWithOptions(
name="step_by_step_reasoning_overall_quality",
description="On a scale of 1 (incomprehensible and wrong) to 5 (clear and correct), does the generated response answer the question in a well-justified manner?",
prediction_field="generated response",
context_fields=[
"question",
"premise",
"hypothesis",
"correct answer",
],
options=[
CriteriaOption(
name="1",
description="",
),
CriteriaOption(
name="2",
description="",
),
CriteriaOption(
name="3",
description="",
),
CriteriaOption(
name="4",
description="",
),
CriteriaOption(
name="5",
description="",
),
],
option_map={
"1": 0.0,
"2": 0.25,
"3": 0.5,
"4": 0.75,
"5": 1.0,
},
)
[source]from unitxt.llm_as_judge_constants import CriteriaOption
Explanation about CriteriaWithOptionsΒΆ
Criteria used by DirectLLMJudge to run evaluations.
Explanation about CriteriaOptionΒΆ
A criteria option.
Read more about catalog usage here.