πŸ“„ ValidityΒΆ

Metric that evaluates tool call predictions for the validity with regards to the tools schema.

metrics.tool_calling.multi_turn.validity

Explanation about MultiTurnToolCallingMetricΒΆ

Compares each predicted tool call with list of references tool call.

Read more about catalog usage here.