proposal: rag evaluation results presentation #7462

davidsbatista · 2024-04-04T07:52:34Z

Proposal for a presentation of RAG pipeline evaluation results

proposals/text/7462-rag-evaluation.md

julian-risch

We're almost there. 👍 I am not sure about the idea of passing other incomparative_summary(self, other: "RAGPipelineEvaluation"): but we can iterate on it later. I wouldn't block the proposal because of that. Better to start with the implementation.
One decision that should be made when implementation of this proposal starts is whether we want to rely on pandas or not. We should check whether we want the Document class to rely on pandas or not for that. We could definitely implement the new eval features without pandas and have one data export option that makes it easy for advanced users to use pandas if they want to.
One change request: could you map the user stories directly to the methods please? For example, that mapping should explain when the user uses find_thresholds for one of the stories from the issue.
I left some minor comments in the proposal too.

proposals/text/7462-rag-evaluation.md

Co-authored-by: Madeesh Kannan <[email protected]>

davidsbatista · 2024-04-08T14:46:32Z

We could definitely implement the new eval features without pandas and have one data export option that makes it easy for advanced users to use pandas if they want to
@julian-risch I agree with you, I would make clear that we don't want pandas, and have JSON that can easily be transformed into a pandas

shadeMe

Just a couple of minor changes before it's good to merge (from my side) 🎉

proposals/text/7462-rag-evaluation.md

Co-authored-by: Madeesh Kannan <[email protected]>

davidsbatista · 2024-04-08T15:56:48Z

@julian-risch @mrm1001 do you want to add, suggest anything else? if not I will merge it

davidsbatista added 2 commits April 2, 2024 22:25

adding files

4eebc29

adding proposal in md

ecb0165

davidsbatista requested review from a team as code owners April 4, 2024 07:52

davidsbatista requested review from dfokina and shadeMe and removed request for a team April 4, 2024 07:52

github-actions bot added proposal 2.x Related to Haystack v2.0 labels Apr 4, 2024

renaming proposal number

7120798

davidsbatista requested review from julian-risch and mrm1001 April 4, 2024 07:53

davidsbatista added 2 commits April 4, 2024 09:55

removing stuff

8d47675

cleaning up

93229d0

shadeMe added the ignore-for-release-notes PRs with this flag won't be included in the release notes. label Apr 4, 2024

davidsbatista added 2 commits April 4, 2024 16:31

Merge branch 'main' into rag-dataset-eval

67fdac0

Merge branch 'main' into rag-dataset-eval

ba2fd8f

shadeMe reviewed Apr 5, 2024

View reviewed changes

adding PR number and issue

32de96b