π RougeΒΆ
This is the classical NLP Rouge metric based on the RougeScorer library (https://github.com/google-research/google-research/tree/master/rouge). It computes metrics several metrics (rouge1, rouge2, roughL, and rougeLsum) based lexical (word) overlap between the prediction and the ground truth references.β
Tags: flags:['reference-based-metric', 'cpu-metric'], category:metric
Explanation about RougeΒΆ
Computes ROUGE scores for text summarization evaluation.
Range: [0, 1] (higher is better) Measures n-gram overlap between prediction and reference texts.
Reference: https://en.wikipedia.org/wiki/ROUGE_(metric)
Read more about catalog usage here.