π Question Answer QualityΒΆ
metrics.llm_as_judge.direct.criteria.question_answer_quality
CriteriaWithOptions(
name="question_answer_quality",
description="Does the response directly answer the question?",
options=[
CriteriaOption(
name="Excellent",
description="The response directly answers the question.",
),
CriteriaOption(
name="Acceptable",
description="The response is adequate but could be better.",
),
CriteriaOption(
name="Could be Improved",
description="The response relates to the questions but does not directly answer it.",
),
CriteriaOption(
name="Bad",
description="The response does not answer the question at all.",
),
],
option_map={
"Excellent": 1.0,
"Acceptable": 0.75,
"Could be Improved": 0.5,
"Bad": 0.0,
},
)
[source]from unitxt.llm_as_judge_constants import CriteriaOption
Read more about catalog usage here.