π ValidityΒΆ
A metric that assesses tool call predictions for their conformity to the tool schema.
metrics.tool_calling.multi_turn.validity
Explanation about MultiTurnToolCallingMetricΒΆ
Compares each predicted tool call with list of references tool call.
Read more about catalog usage here.