Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
unstructured: fix metadata order mixed up (#336)
* Optional meta field for UnstructuredFileConverter with proper tests * black lint * Adding multiple files and meta list test case * Black formatting test * Fixing metadata page number bug. Deep copy of dict * Folder of files test * Update integrations/unstructured/src/haystack_integrations/components/converters/unstructured/converter.py Co-authored-by: Stefano Fiorucci <[email protected]> * Update integrations/unstructured/src/haystack_integrations/components/converters/unstructured/converter.py Co-authored-by: Stefano Fiorucci <[email protected]> * Update integrations/unstructured/src/haystack_integrations/components/converters/unstructured/converter.py Co-authored-by: Stefano Fiorucci <[email protected]> * Renaming "name" meta to "file_path" and deepcopy fix * Fix Ruff Complaining * Removing unique file logic using set that does not preserve file orders. Raise error if glob and metadata list because unsafe * Better test to make sure metadata order are preserved. * Make a failing test if metadata list and directory * filepaths as lists * Update integrations/unstructured/src/haystack_integrations/components/converters/unstructured/converter.py Co-authored-by: Stefano Fiorucci <[email protected]> * update meta docstrings --------- Co-authored-by: Stefano Fiorucci <[email protected]>
- Loading branch information