Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto-merging Retrieval #7863

Closed
2 tasks done
davidsbatista opened this issue Jun 14, 2024 · 1 comment
Closed
2 tasks done

Auto-merging Retrieval #7863

davidsbatista opened this issue Jun 14, 2024 · 1 comment
Assignees
Labels
2.x Related to Haystack v2.0 P2 Medium priority, add to the next sprint if no P1 available topic:retriever

Comments

@davidsbatista
Copy link
Contributor

davidsbatista commented Jun 14, 2024

Auto-merging retrieval aims to combine (or merge) information from multiple sources or segments of text to create a more comprehensive and contextually relevant response to a query.

This approach is particularly useful when no single document or segment fully answers the query but rather the answer lies in combining information from multiple sources.

It allows smaller chunks to be merged into bigger parent chunks. It does this via the following steps:

  1. Define a hierarchy of smaller chunks linked to parent chunks.
  2. If the set of smaller chunks linking to a parent chunk exceeds some threshold retrieve the parent chunk.

The method will finally retrieve the parent chunk for better context.

  • splitting documents into nodes of different block sizes, including parent and child nodes
  • implement a retriever based on hierarchical chunking merging
@julian-risch
Copy link
Member

closed by deepset-ai/haystack-experimental#56

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.x Related to Haystack v2.0 P2 Medium priority, add to the next sprint if no P1 available topic:retriever
Projects
None yet
Development

No branches or pull requests

3 participants