Discrepancy between my evaluation results and README for MNLI in evaluation.py #40

TinaChen95 · 2023-03-07T03:25:30Z

Hi, I'm running evaluation.py on MNLI as described in the README, but I'm getting different results compared to what's displayed there. I'm using Google Colab for this, and you can find my notebook here: https://colab.research.google.com/drive/1UahAOTIwALfEC_DXE11mVOp5iSgwHoYH?usp=sharing

When I run evaluation.py, it shows the following results:
Task: mnli
Model path: ../CoFi-MNLI-s95
Model size: 4330279
Sparsity: 0.949
Accuracy: 0.091
Seconds/example: 0.000561

However, in the README file, the results for the same evaluation are different:
Task: MNLI
Model path: princeton-nlp/CoFi-MNLI-s95
Model size: 4920106
Sparsity: 0.943
mnli/acc: 0.8055
Seconds/example: 0.010151

I need help figuring out why there's a discrepancy between my results and what's described in the README. I've tried to follow the instructions in the README as closely as possible, but I may have missed something. Thank you for any assistance you can provide.

xiamengzhou · 2023-03-15T01:53:47Z

this is interesting, what is the model path ../CoFi-MNLI-s95?

gaishun · 2023-03-26T17:47:21Z

Have you solved this problem? I encountered the same problem.

xiamengzhou · 2023-06-28T01:32:49Z

It seems to be an issue with transformers' versions. It should be compatible with transformers==4.17.0 and datasets==1.14.0 but might not work with versions beyond.

SHUSHENGQIGUI · 2024-07-01T07:45:23Z

It seems to be an issue with transformers' versions. It should be compatible with transformers==4.17.0 and datasets==1.14.0 but might not work with versions beyond.
but that setting is always conficted with the huggingface version. can i check out your python env? I am crazy about this issue. please

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discrepancy between my evaluation results and README for MNLI in evaluation.py #40

Discrepancy between my evaluation results and README for MNLI in evaluation.py #40

TinaChen95 commented Mar 7, 2023 •

edited

Loading

xiamengzhou commented Mar 15, 2023

gaishun commented Mar 26, 2023

xiamengzhou commented Jun 28, 2023

SHUSHENGQIGUI commented Jul 1, 2024

Discrepancy between my evaluation results and README for MNLI in evaluation.py #40

Discrepancy between my evaluation results and README for MNLI in evaluation.py #40

Comments

TinaChen95 commented Mar 7, 2023 • edited Loading

xiamengzhou commented Mar 15, 2023

gaishun commented Mar 26, 2023

xiamengzhou commented Jun 28, 2023

SHUSHENGQIGUI commented Jul 1, 2024

TinaChen95 commented Mar 7, 2023 •

edited

Loading