Replies: 5 comments 11 replies
-
This seems like a slam dunk to me, very small complexity cost for substantial utility.
When I think of this, I think of broader support for multi-dimension partitioning. I believe supporting that here makes sense as part of broader support of it. For example Having a single asset belong to multiple partitions makes less sense to me. If this was what you meant, do you have a motivating example? |
Beta Was this translation helpful? Give feedback.
-
It looks like the ship has sailed on this, but to me |
Beta Was this translation helpful? Give feedback.
-
Just to talk through it a bit more, if you have
At some level, that is not really any different than
except for losing the information that What if we pushed this information down in to
|
Beta Was this translation helpful? Give feedback.
-
Tests now passing: https://dagster.phacility.com/D4526 |
Beta Was this translation helpful? Give feedback.
-
With this RFC, how will Right now, I effectively treat If I have a solid that takes a partitioned dataframe This is especially true when there are multiple partitioned assets in a pipeline. Say there are two assets |
Beta Was this translation helpful? Give feedback.
-
Motivation
Conceptually, when we say that “a pipeline is partitioned”, we really mean that it produces or consumes partitioned assets.
More concretely, anyone responsible for maintaining an asset would find it useful to be able to answer these questions about it:
Proposed Changes
Add a partition attribute to AssetMaterialization:
Should we allow a single AssetMaterialization to include multiple partitions?
Dagit
Some cool views we could build:
Asset-Partition Matrix
Plot a value over time like we do with current run-based longitudinal views
Beta Was this translation helpful? Give feedback.
All reactions