-
Notifications
You must be signed in to change notification settings - Fork 272
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement ApplyToInternal::compute_output_shape
and JSONSelection::shape
returning shape::Shape
#6458
base: next
Are you sure you want to change the base?
Conversation
✅ Docs Preview ReadyNo new or changed pages found. |
CI performance tests
|
```ebnf | ||
JSONSelection ::= PathSelection | NamedSelection* | ||
SubSelection ::= "{" NamedSelection* "}" | ||
SubSelection ::= "{" OutputTypeAnnotation? NamedSelection* "}" | ||
OutputTypeAnnotation ::= "<" Identifier ">" | ||
NamedSelection ::= NamedPathSelection | PathWithSubSelection | NamedFieldSelection | NamedGroupSelection | ||
NamedPathSelection ::= Alias PathSelection | ||
NamedPathSelection ::= (Alias | "...") PathSelection | ||
NamedFieldSelection ::= Alias? Key SubSelection? | ||
NamedGroupSelection ::= Alias SubSelection |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here's the only new JSONSelection syntax we need: { <Type> ... }
annotations and ...
as a way of denoting inline NamedPathSelection
items.
} else if let Ok((remainder, _dots)) = | ||
tuple((spaces_or_comments, ranged_span("...")))(input) | ||
{ | ||
// TODO Enforce path has a static output shape with known object keys. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here's a remaining TODO
.
// TODO If the path is inlined with ... and has no explicit | ||
// SubSelection, we should use the computed Shape of the | ||
// path to determine the output fields. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another TODO
I'm planning to get to before taking this PR out of draft.
} else if *inline { | ||
// If the PathSelection is inlined with ... but does not | ||
// have a subselection, we can use the computed Shape of | ||
// the path to synthesize the output selections. | ||
// TODO Implement this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another TODO
for the list.
NamedSelection ::= NamedPathSelection | PathWithSubSelection | NamedFieldSelection | NamedGroupSelection | ||
NamedPathSelection ::= Alias PathSelection | ||
NamedPathSelection ::= (Alias | "...") PathSelection |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this allow for future keywords like if
or match
directly after the ...
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the PathSelection
started with an if
token, that would be a potential concern for making if
a keyword in the future, but if we stick with the the idea of a parenthesized test (i.e. ... if (test) {}
), the (
character should prevent parsing as a PathSelection
, causing the parser to backtrack and consider other ways of parsing ... if (test) {}
(which we can define).
Same goes for ... match (value)
, as long as we use parentheses, or as long as we're willing to make the match
field illegal immediately after ...
(a breaking change but not unthinkable).
@@ -69,6 +73,44 @@ impl JSONSelection { | |||
|
|||
(value, errors.into_iter().collect()) | |||
} | |||
|
|||
pub fn shape(&self) -> Shape { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what's the reason for having shape code in this file? it feels different enough to warrant a separate file and maybe a separate trait, even if it's structured basically the same
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It made the code a little easier to write to have apply_to_path
nearby, but there's probably no harm in moving it to a new file now.
|
||
for pair in args { | ||
if let LitExpr::Array(pair) = pair.as_ref() { | ||
if pair.len() == 2 { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we return an error if the length isn't exactly 2?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you have an example of error handling? since this doesn't return a Result
, what should a caller do with a Shape::error_with_range
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I imagine we'll add a way to collect/traverse/visit any ShapeCase::Error
variants within a given Shape
, to handle them programmatically or surface them in the debugger, but for now the main point of errors is to fail validation against expected GraphQL schema types.
Since you can embed ShapeCase::Error
within a larger shape, shape errors are similar to the GraphQL errors-as-data pattern, where you're able to get back some partial shape information along with contextual errors.
909e897
to
e5eedde
Compare
15b6b8c
to
1fde6d8
Compare
e5eedde
to
16d0306
Compare
1fde6d8
to
ebb3e95
Compare
16d0306
to
dfeb558
Compare
ebb3e95
to
70486c1
Compare
This syntax allows us to model the GraphQL behavior of automatically mapping selection sets over lists of objects, even when we can't determine (yet) whether the input has an array shape or not (e.g. any undeclared named shape reference). Once you provide enough input shape information to resolve the array ambiguity, you won't see as many `.*` segments in named shape subpaths (and you also won't see as many named shape references in general, because you've probably provided at least some of the shape definitions for those names).
TODO Make sure we audit any validations preventing key collisions, because we now have a more graceful way of handling the collisions.
The special `{ <Type> }` syntax for selection sets still works as a shorthand for `{ __typename: "Type" }`, and may be more useful for static analysis because it enforces `Type` is an identifier. We may even want to forbid setting `__typename` to anything other than a (union of) string literal shapes, or forbid setting `__typename` directly altogether, in favor of the `<Type>` syntax, since it provides the flexibility we're comfortable with.
This allows us to postpone PR #6456 for further consideration.
70486c1
to
2658a33
Compare
This draft PR uses our newly published
shape
crate to implement theApplyToInternal::compute_output_shape
trait method, which returns ashape::Shape
, which is an immutable and reference-counted Rust data structure that describes a set of constraints on JSON data. The "shape" terminology is a close synonym for "type" but I'm using it to emphasize the immutable, reference-free, value-typed nature of the JSON domain.As an important part of this
::compute_output_shape
system, all->
methods now define amethod_shape
function that allows them to reject unexpected input shapes and compute their own output shape in terms of a given input shape and variable shapes. In other words, the type signatures of->
methods are expressed using Rust code which ultimately returns ashape::Shape
, which then gets further processed by thecompute_output_shape
logic, allowing->
methods to blend seamlessly into the shape processing system.This new ability to compute a static output
Shape
for a given JSONSelection string allows us to relax the constraint that inlinePathSelection
items must have a static{...}
subselection at the end. The static subselection was previously our only means of statically determining the output fields of thePathSelection
, but now we can statically/programmatically examine the outputShape
, which doesn't distinguish between static subselections and->
method results (as long as they have the same computed object shape). We are also now requiring a...
token before inlinePathSelection
items without static subselections, and encouraging the...
token for any inlinePathSelection
, even if it has a subselection (though enforcing it in that case would be a breaking change).Finally, this PR proposes a way of setting the
__typename
in situations where abstract GraphQL types (interfaces and unions) are involved: by writing a{ <Type> ... }
annotation at the beginning of any subselection, developers can indicate to both theShape
system and the runtime system that the__typename
of this output object should have the literal shape"Type"
, and the object should include__typename: "Type"
at runtime. The underlyingShape
representation actually does use a__typename
field to model this information, but that's a choice made by the JSONSelection code, not something embedded in theShape
library.I wish I could have broken this PR up more, though I was able to split out #6455, #6456, and #6457.
I don't believe this PR should introduce any logical runtime differences of behavior in the connectors system, since it's basically adding the
compute_output_shape
method and then only using it in tests. Once I address a fewTODO
s, the output shapes will play a role in checking the validity of...
-inlinedPathSelection
items, but the...
syntax is new with this PR, so changing it is still easy.