feat: add faithfulness metric based on Bespoke Labs MiniCheck model #1269

vutrung96 · 2024-09-10T21:43:13Z

This PR adds a faithfulness metric based on the Bespoke-MiniCheck-7B model.

Users can use the metric either by calling the model through the Bespoke Labs API, or by running the model locally.

I tested that the metric works via a colab: https://colab.research.google.com/drive/1OcL8-LkeKp-_7-_8_l7ysO8O6_AIz6jd#scrollTo=Jbg0gon7uXII.

shahules786 · 2024-09-11T03:59:34Z

Thanks for the PR @vutrung96 , I will take a look at it shortly.

vutrung96 · 2024-09-11T16:32:34Z

@shahules786 thanks! one thing I'm running into is that for make type, I'm getting import not found for this line:

"import einops as einops"

I think it's because in CI, the command pip install is run to install ragas, which doesn't include the optional dependencies.

jjmachan · 2024-09-12T08:46:55Z

@vutrung96 you can add that as part of the dev dependencis in requirements/dev.txt?

vutrung96 · 2024-09-12T14:19:32Z

@jjmachan thanks for the suggestion! I've added the dependencies to dev/requirements.txt.

vutrung96 · 2024-09-18T17:59:38Z

@shahules786 ping on review. please lmk if you need any clarifications :)

shahules786 · 2024-09-19T05:59:58Z

@vutrung96 I'm rethinking and reworking parts of faithfulness metrics. I'm also thinking best ways to allow users to use any NLI model within it without adding code into ragas.
please bear with me on this.

shahules786 · 2024-09-21T11:09:13Z

Hi @vutrung96 , thanks again for the PR.

We want to enable developers to use any model of their choice, regardless of the metric they use. There are two types of models in this context:

General-purpose models (e.g., OpenAI, Anthropic, LLaMA, etc.)
Specialized models that are limited to one or more tasks (e.g., Vectara HHEM, Bespoke)

For the former, we already support the use of any model with ragas. For the latter, currently, either we or the user has to modify the code in ragas to integrate the specialized model. It is fine if the user does this in their own version of ragas, but merging that code into the main ragas repository transfers the responsibility of maintaining and updating it ( this is the case with this PR), which is not something we can take on.

Therefore, we are introducing components that allow developers to plug in their specialized models for use with metrics. This is still experimental but can be integrated into ragas after a few iterations and feedback. #1339

Here’s the revised version:

In your case, I think the model can be used as a component by passing it as a HuggingfacePipeline to LLMComponent. Please take a look at it, and perhaps we can add documentation to help users with this integration.

vutrung96 added 14 commits September 10, 2024 02:33

first draft for ragas minicheck faithfulness

1308b3f

fix deps

2d99947

more fixes

38bd165

faithfulness fix

27b050d

more faithfulness fix

b6d752f

add api support

8b75541

fix formatting

3b03d38

more fixes

09e6f63

fix ci line

a8329fa

fix

040162b

local import

71e3794

docs

424496c

fix formatting

67bf5d4

fix deletion

22162a7

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Sep 10, 2024

vutrung96 added 6 commits September 10, 2024 17:43

Merge branch 'main' into main

895405f

fix optional dep

d4d3185

add trust_remote_code to tokenizer

8084c06

correct variable naming in minicheck example

1cecb3e

fix docs to include grouth truth from new api

98bd803

remove yes_token

41b3691

vutrung96 added 2 commits September 11, 2024 13:58

fix lint

5b169cf

fix type

1a11d01

vutrung96 added 2 commits September 12, 2024 14:14

add requirements to dev

0b194ff

add sentence piece

e865c00

shahules786 self-requested a review September 12, 2024 14:48

Merge branch 'main' into main

f377620

shahules786 closed this Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add faithfulness metric based on Bespoke Labs MiniCheck model #1269

feat: add faithfulness metric based on Bespoke Labs MiniCheck model #1269

vutrung96 commented Sep 10, 2024

shahules786 commented Sep 11, 2024

vutrung96 commented Sep 11, 2024

jjmachan commented Sep 12, 2024

vutrung96 commented Sep 12, 2024

vutrung96 commented Sep 18, 2024

shahules786 commented Sep 19, 2024

shahules786 commented Sep 21, 2024 •

edited

Loading

feat: add faithfulness metric based on Bespoke Labs MiniCheck model #1269

feat: add faithfulness metric based on Bespoke Labs MiniCheck model #1269

Conversation

vutrung96 commented Sep 10, 2024

shahules786 commented Sep 11, 2024

vutrung96 commented Sep 11, 2024

jjmachan commented Sep 12, 2024

vutrung96 commented Sep 12, 2024

vutrung96 commented Sep 18, 2024

shahules786 commented Sep 19, 2024

shahules786 commented Sep 21, 2024 • edited Loading

shahules786 commented Sep 21, 2024 •

edited

Loading