-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to read a parquet file from multiple directories at once #96
Comments
I am not quite sure I understand what you mean. Can you do the same with just |
like this: spark.index.create.indexByAll().parquet(path1,path2,path3) |
One of my Parquet tables has multiple paths, but not a partition type. Now to create index for a table, I need to pass in all the paths,I've already implemented this,thanks |
I don't quite understand your directory structure. Usually, a parquet table is represented as a directory with files or partition sub-directories. If there is some sort of parent directory for all of those paths, you can try providing that instead. Right now, it is not possible to pass multiple paths, maybe it makes sense to add this functionality. |
Ok, I have understood. Thank you for your reply |
Hi,I want to read a parquet file in multiple directories at once, but I don't see the interface. Do I need to do this myself? Or is there an alternative
The text was updated successfully, but these errors were encountered: