π MetricsΒΆ
- π Bert_score
- π Perplexity
- π Perplexity_a
- π Perplexity_chat
- π Perplexity_nli
- π Perplexity_q
- π Rag
- π Reward
- π Robustness
- π Sentence_bert
- π accuracy
- π accuracy_binary
- π bleu
- π char_edit_dist_accuracy
- π char_edit_distance
- π f1_binary
- π f1_macro
- π f1_macro_multi_label
- π f1_micro
- π f1_micro_multi_label
- π f1_weighted
- π kendalltau_b
- π kpa
- π map
- π matthews_correlation
- π max_accuracy_binary
- π max_f1_binary
- π mrr
- π ndcg
- π ner
- π normalized_sacrebleu
- π precision_binary
- π precision_macro_multi_label
- π precision_micro_multi_label
- π recall_binary
- π recall_macro_multi_label
- π recall_micro_multi_label
- π regard
- π rerank_recall
- π retrieval_at_k
- π roc_auc
- π rouge
- π rouge_with_confidence_intervals
- π sacrebleu
- π safety
- π spearman
- π squad
- π string_containment
- π token_overlap
- π token_overlap_with_context
- π unsorted_list_exact_match
- π wer
Read more about catalog usage here.