Improve RAG capabilities of DocumentStore
#8508
Labels
P2
Medium priority, add to the next sprint if no P1 available
DocumentStore
#8508
Is your feature request related to a problem? Please describe.
If you are building a RAG pipeline, then the indexing pipeline is of course an essential part of it.
You usually don't run the indexing once but rather have it as an ongoing process which synchronizes data from files to indexed documents. For this, one needs the following capabilities:
The current implementation of the
DocumentStore
protocol is in that regards a bit too simple. For production ready use cases you need the above methods so that you can actually build and maintain a RAG application.Currently I need to manually implement this stuff outside of the document store protocol which means outside of Haystack which is painful and has potential for an improved developer experience.
Describe the solution you'd like
Extend the
DocumentStore
protocol and add implementations for the existing document stores.Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Talk to your favorite deepset team if you want to get some more input on running RAG pipelines in production :-)
The text was updated successfully, but these errors were encountered: