π End To EndΒΆ
- This is a task corresponding to an end to end RAG evaluation. It assumes the user provides a question, and
the RAG system returns an answer and a set of retrieved contexts (documents or passages). For details of RAG see: https://www.unitxt.ai/en/latest/docs/rag_support.html.
tasks.rag.end_to_end
type: Task
input_fields:
question: Union[str, Dialog]
question_id: Any
metadata_field: str
reference_fields:
reference_answers: List[str]
reference_contexts: List[str]
reference_context_ids: Union[List[int], List[str]]
is_answerable_label: bool
metrics:
- metrics.rag.end_to_end.answer_correctness
- metrics.rag.end_to_end.answer_faithfulness
- metrics.rag.end_to_end.answer_reward
- metrics.rag.end_to_end.context_correctness
- metrics.rag.end_to_end.context_relevance
prediction_type: RagResponse
augmentable_inputs:
- question
defaults:
question_id:
metadata_field:
reference_answers: []
reference_contexts: []
reference_context_ids: []
is_answerable_label: True
default_template: templates.rag.end_to_end.json_predictions
[source]Explanation about TaskΒΆ
Task packs the different instance fields into dictionaries by their roles in the task.
- Args:
- input_fields (Union[Dict[str, str], List[str]]):
Dictionary with string names of instance input fields and types of respective values. In case a list is passed, each type will be assumed to be Any.
- reference_fields (Union[Dict[str, str], List[str]]):
Dictionary with string names of instance output fields and types of respective values. In case a list is passed, each type will be assumed to be Any.
- metrics (List[str]):
List of names of metrics to be used in the task.
- prediction_type (Optional[str]):
Need to be consistent with all used metrics. Defaults to None, which means that it will be set to Any.
- defaults (Optional[Dict[str, Any]]):
An optional dictionary with default values for chosen input/output keys. Needs to be consistent with names and types provided in βinput_fieldsβ and/or βoutput_fieldsβ arguments. Will not overwrite values if already provided in a given instance.
- The output instance contains three fields:
βinput_fieldsβ whose value is a sub-dictionary of the input instance, consisting of all the fields listed in Arg βinput_fieldsβ.
βreference_fieldsβ β for the fields listed in Arg βreference_fieldsβ.
βmetricsβ β to contain the value of Arg βmetricsβ
References: metrics.rag.end_to_end.context_correctness, metrics.rag.end_to_end.answer_faithfulness, metrics.rag.end_to_end.answer_correctness, templates.rag.end_to_end.json_predictions, metrics.rag.end_to_end.context_relevance, metrics.rag.end_to_end.answer_reward
Read more about catalog usage here.