Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a new ROUGE metric to Evidently #1318

Open
elenasamuylova opened this issue Sep 23, 2024 · 2 comments
Open

Add a new ROUGE metric to Evidently #1318

elenasamuylova opened this issue Sep 23, 2024 · 2 comments
Labels
enhancement New feature or request hacktoberfest Accepted contributions will count towards your hacktoberfest PRs

Comments

@elenasamuylova
Copy link
Collaborator

elenasamuylova commented Sep 23, 2024

About Hacktoberfest contributions: https://github.com/evidentlyai/evidently/wiki/Hacktoberfest-2024

Description

The ROUGE (Recall-Oriented Understudy for Gisting Evaluation) metric evaluates the quality of a generated text by comparing it to a reference text (typically a summary). It measures how much of the reference text is covered by the generated summary through n-gram overlap. Several common ROUGE variants exist:

  • ROUGE-1: Measures unigram (word-level) overlap.
  • ROUGE-2: Measures bigram (two-word sequence) overlap.
  • ROUGE-N: Measures n-gram overlap between the candidate and reference text.

We can implement a ROUGE metric that takes the parameter n and computes both the descriptor values (overlap) for each row and a summary ROUGE metric for the dataset.

Note that this implementation would require creating a new Metric (instead of defaulting to ColumnSummaryMetric to aggregate descriptors values) to compute and visualize the summary ROUGE score. You can check other dataset-level metrics (e.g., from classification or ranking) for inspiration.

@elenasamuylova elenasamuylova added enhancement New feature or request hacktoberfest Accepted contributions will count towards your hacktoberfest PRs labels Sep 23, 2024
@pmittaldev
Copy link

Hey @elenasamuylova , I am working on it.

@pmittaldev
Copy link

Screenshot 2024-10-02 at 02 45 59

Do we need to show text as well for the comparison or just score per row is enough along with summary score?

pmittaldev added a commit to pmittaldev/evidently that referenced this issue Oct 28, 2024
pmittaldev added a commit to pmittaldev/evidently that referenced this issue Oct 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request hacktoberfest Accepted contributions will count towards your hacktoberfest PRs
Projects
None yet
Development

No branches or pull requests

2 participants