πŸ“„ RougeΒΆ

This is the classical NLP Rouge metric based on the RougeScorer library (https://github.com/google-research/google-research/tree/master/rouge). It computes metrics several metrics (rouge1, rouge2, roughL, and rougeLsum) based lexical (word) overlap between the prediction and the ground truth references.”

Tags: flags:['reference-based-metric', 'cpu-metric'], category:metric

metrics.rouge

Explanation about RougeΒΆ

Computes ROUGE scores for text summarization evaluation.

Range: [0, 1] (higher is better) Measures n-gram overlap between prediction and reference texts.

Reference: https://en.wikipedia.org/wiki/ROUGE_(metric)

Read more about catalog usage here.