πŸ“„ Step By Step Reasoning Overall QualityΒΆ

metrics.llm_as_judge.direct.criteria.step_by_step_reasoning_overall_quality

CriteriaWithOptions(
    name="step_by_step_reasoning_overall_quality",
    description="On a scale of 1 (incomprehensible and wrong) to 5 (clear and correct), does the generated response answer the question in a well-justified manner?",
    prediction_field="generated response",
    context_fields=[
        "question",
        "premise",
        "hypothesis",
        "correct answer",
    ],
    options=[
        CriteriaOption(
            name="1",
            description="",
        ),
        CriteriaOption(
            name="2",
            description="",
        ),
        CriteriaOption(
            name="3",
            description="",
        ),
        CriteriaOption(
            name="4",
            description="",
        ),
        CriteriaOption(
            name="5",
            description="",
        ),
    ],
    option_map={
        "1": 0.0,
        "2": 0.25,
        "3": 0.5,
        "4": 0.75,
        "5": 1.0,
    },
)
[source]

from unitxt.llm_as_judge_constants import CriteriaOption

Explanation about CriteriaWithOptionsΒΆ

Criteria used by DirectLLMJudge to run evaluations.

Explanation about CriteriaOptionΒΆ

A criteria option.

Read more about catalog usage here.