Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cedar Schema Annotations #48

Merged
merged 11 commits into from
Nov 19, 2024
139 changes: 139 additions & 0 deletions text/0048-schema-annotations.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,139 @@
# Annotations for Schemas

## Related issues and PRs

- Reference Issues:
- Implementation PR(s):

## Timeline

- Started: 2024-02-05

## Summary

Like Cedar policies, users may want to associate arbitrary, machine readable metadata with Schema objects.
We solved this problem in Cedar Policies by allowing for *annotations*: arbitrary key/value pairs that are attachable to policies.
This could be extended to Cedar Schemas, allowing users to attach attributes an entity type, common type, and action definitions.


## Basic example

1. Doc comments
```
@doc("This entity defines our central user type")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You have a number here as if this is the first of several examples, but it's the only one.

For doc comments, it makes more sense to me to define an opinionated literate format. This is what Rust, Java, etc. all do, and it's nicer to read and perhaps more natural to write (per your Alternative "Doc strings as comments"). But we could still also have arbitrary annotations to handle other use-cases (which it would be nice to see, here).

entity User {
owner : User,
team : String
}
```

```
@doc("Stop users from accessing a high security documents unless they are in the same building")
forbid(principal, action, resource) when {
resource.security_level == "HIGH"
unless {
resource.location == principal.location
};
```
john-h-kastner-aws marked this conversation as resolved.
Show resolved Hide resolved

## Motivation

Users should be allowed to associate machine readable metadata with objects in a Schema.
While we could create special syntax for associating particular kinds of metadata, we cannot
predict all of the metadata uses that users will have.
Thus providing a flexible system that users can adapt to their needs is preferrable.
This proposal re-uses the same syntax from Cedar Policies, creating a unified syntax.


## Detailed design

### Semantics
Attributes have **no** impact on validation decisions.
Attributes are arbitrary key/value pairs where:
* 'key' is a valid Cedar identifier
* 'value' is a Cedar string


The Cedar spec takes no opinion or stance on the interpretation of annotations.
The interpretation is entirely up to users of Cedar.

### Human Readable Syntax
Attributes in Cedar Schemas will mirror the syntax used for attributes in a policy: informally that's `@<key>("value")`.
Formally the following rule is added to the Cedar grammar:
```
Annotation := '@' IDENT '(' STR ')'
Annotations := Annotation {Annotations}
```
With an arbitrary number of them being able to prepend to a top level declaration in a schema.

Thus the full schema syntax becomes:
```
Schema := {Namespace}
Namespace := ('namespace' Path '{' {Decl} '}') | {Decl}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Based on some recent discussion, we might want to support annotations on the namespace. I don't see a reason not to.

Decl := Entity | Action | TypeDecl
Entity := Annotations 'entity' Idents ['in' EntOrTyps] [['='] RecType] ';'
Action := Annotations 'action' Names ['in' (Name | '[' [Names] ']')] [AppliesTo] [ActAttrs]';'
john-h-kastner-aws marked this conversation as resolved.
Show resolved Hide resolved
TypeDecl := Annotations 'type' IDENT '=' Type ';'
Type := PRIMTYPE | IDENT | SetType | RecType
EntType := Path
SetType := 'Set' '<' Type '>'
RecType := '{' [AttrDecls] '}'
AttrDecls := Name ['?'] ':' Type [',' | ',' AttrDecls]
AppliesTo := 'appliesTo' '{' AppDecls '}'
ActAttrs := 'attributes' '{' AttrDecls '}'
AppDecls := ('principal' | 'resource') ':' EntOrTyps [',' | ',' AppDecls]
| 'context' ':' RecType [',' | ',' AppDecls]
Path := IDENT {'::' IDENT}
EntTypes := Path {',' Path}
EntOrTyps := EntType | '[' [EntTypes] ']'
Name := IDENT | STR
Names := Name {',' Name}
Idents := IDENT {',' IDENT}
Annotation := '@' IDENT '(' STR ')
Annotations := Annotation {Annotations}

IDENT := ['_''a'-'z''A'-'Z']['_''a'-'z''A'-'Z''0'-'9']* - PRIMTYPE
STR := Fully-escaped Unicode surrounded by '"'s
PRIMTYPE := 'Long' | 'String' | 'Bool'
WHITESPC := Unicode whitespace
COMMENT := '//' ~NEWLINE* NEWLINE
```

### JSON Syntax
None of the three top-level constructs (EntityTypes, Actions, CommonTypes) in schemas allow for arbitrary key/value pairs.
This means a new key can be safely added while preserving backwards compatibility.
This proposal reserves the `annotations` key at the top level of each of those constructs, which contains an Object, containing each annotation key as an Object key, associated with the annotation value.
The only oddness here is Common Types, whose toplevel is a regular type. While this should still be backwards compatible, it will look a little odd to have annotations in some types and not in others.


## Drawbacks

1. Complexity: adds more complexity to schema
2. Oddness around syntax for Common Types in JSON form
3. By not taking a stance on annotation meanings, it makes it harder for a standard to form around them (ex: for doc strings)
4. Multi-line docstrings are technically valid but awkward.

## Alternatives

### Take a stance
Reverse our decision around annotations and start taking stances on what annotations mean.
aaronjeline marked this conversation as resolved.
Show resolved Hide resolved
This lets us standardize certain annotations, like `doc`.
This probably can't happen unless we also do this for policies, which we've said we don't want to do.
### Doc Strings as comments
Instead of annotations, we could add "doc-strings" as a first class feature.
Could look like this:
```
/# Stop users from accessing a high security document unless:
/# A) The principal and user are at the same location
/# B) The principal has a job level greater than 4
forbid(principal, action, resource) when {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the main use case motivating this RFC is documentation, then I would prefer doc strings over annotations.

In general, I don't find the docs use case too compelling. Is there some other example you have in mind?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, the docs case was what was asked for by someone in the slack, and I was seeing if people preferred the more general approach

Copy link
Contributor

@john-h-kastner-aws john-h-kastner-aws Feb 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With the custom schema syntax I do think we'll see more requests for annotations in the schema eventually. It would feel natural to use the same annotation mechanism in policies and schema.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No reason we can't do both annotations and doc comments

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In Rust, doc comments are actually just sugar for Rust's equivalent of annotations. (Specifically, sugar for the #[doc] attribute.) We could consider doing something similar. What that means for this RFC is that we could support just doc comments now, and later if we introduce general annotations we could have doc comments desugar into a special reserved annotation. Or, we could support general annotations now, and later we could introduce doc comments as a sugar on top. We don't have to worry about closing off one or the other path.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For a doc comment syntax, I think //# is preferable to /# since backwards compatible with valid line comments, and similar to other syntaxes for overloading comments. An example is //! , the esbuild legal comment syntax to preserve in output files .

//! Copyright Cedar Contributors
//! SPDX-License-Identifier: Apache-2.0

resource.security_level == "HIGH"
unless {
(resource.location == principal.location) || (principal.job_level > 4 )
};
```
This has nice and easy multi-line syntax, but is special cased and not as general.

## Unresolved questions

* If this RFC is accepted, do we want to patch [RFC24 (Schema Syntax](https://github.com/cedar-policy/rfcs/issues/24") to contain the full new grammar so it's in one place?
aaronjeline marked this conversation as resolved.
Show resolved Hide resolved