π ValidityΒΆ
Metric that evaluates tool call predictions for the validity with regards to the tools schema.
metrics.tool_calling.multi_turn.validity
Explanation about MultiTurnToolCallingMetricΒΆ
Compares each predicted tool call with list of references tool call.
Read more about catalog usage here.