πŸ“„ End To EndΒΆ

tasks.rag.end_to_end

type: Task
input_fields: 
  question: str
  question_id: Any
  metadata_field: str
reference_fields: 
  reference_answers: List[str]
  reference_contexts: List[str]
  reference_context_ids: List[str]
  is_answerable_label: bool
metrics: 
  - metrics.rag.end_to_end.answer_correctness
  - metrics.rag.end_to_end.answer_faithfulness
  - metrics.rag.end_to_end.answer_reward
  - metrics.rag.end_to_end.context_correctness
  - metrics.rag.end_to_end.context_relevance
prediction_type: Dict[str, Any]
augmentable_inputs: 
  - question
[source]

Explanation about TaskΒΆ

Task packs the different instance fields into dictionaries by their roles in the task.

Attributes:
input_fields (Union[Dict[str, str], List[str]]):

Dictionary with string names of instance input fields and types of respective values. In case a list is passed, each type will be assumed to be Any.

reference_fields (Union[Dict[str, str], List[str]]):

Dictionary with string names of instance output fields and types of respective values. In case a list is passed, each type will be assumed to be Any.

metrics (List[str]): List of names of metrics to be used in the task. prediction_type (Optional[str]):

Need to be consistent with all used metrics. Defaults to None, which means that it will be set to Any.

defaults (Optional[Dict[str, Any]]):

An optional dictionary with default values for chosen input/output keys. Needs to be consistent with names and types provided in β€˜input_fields’ and/or β€˜output_fields’ arguments. Will not overwrite values if already provided in a given instance.

The output instance contains three fields:

β€œinput_fields” whose value is a sub-dictionary of the input instance, consisting of all the fields listed in Arg β€˜input_fields’. β€œreference_fields” – for the fields listed in Arg β€œreference_fields”. β€œmetrics” – to contain the value of Arg β€˜metrics’

References: metrics.rag.end_to_end.answer_faithfulness, metrics.rag.end_to_end.context_correctness, metrics.rag.end_to_end.answer_correctness, metrics.rag.end_to_end.context_relevance, metrics.rag.end_to_end.answer_reward

Read more about catalog usage here.