Skip to content
This repository has been archived by the owner on Oct 28, 2024. It is now read-only.

[Draft] Clarify serialization and discoverability #40

Closed
wants to merge 13 commits into from

Conversation

roll
Copy link
Member

@roll roll commented Feb 21, 2024


Overview

It's the first attempt to clarify the specifications serialization and discoverability. Also it deals with often requested YAML

@roll roll changed the title Clarify serialization Clarify serialization formats Feb 21, 2024
Copy link

cloudflare-workers-and-pages bot commented Feb 21, 2024

Deploying datapackage with  Cloudflare Pages  Cloudflare Pages

Latest commit: 610a788
Status: ✅  Deploy successful!
Preview URL: https://9094971e.datapackage.pages.dev
Branch Preview URL: https://292-clarify-serialization.datapackage.pages.dev

View logs

@roll roll changed the title Clarify serialization formats Clarify serialization and discoverability Feb 21, 2024
@nichtich
Copy link

A data producer MAY use any suitable serialization format. A data consumer MUST support JSON serialization format and > MAY support other serialization formats like YAML or TOML.

The above states that JSON is the only serialization format that MUST be used for publishing

The data producer is the publishing agent, so either it MUST use JSON or it MAY use JSON. I guess you meant to say something like

A data producer MUST publish JSON but it MAY use any suitable serialization format to create JSON from.

@roll
Copy link
Member Author

roll commented Feb 23, 2024

@nichtich
The point was that one can serialize a descriptor using any format, but only JSON is guaranteed to be understood by consumers. It reflects that we can't forbid e.g. frictionless-py or Open Data Editor generate YAML descriptors but we can state that it's not acceptable for external data publishing

@roll
Copy link
Member Author

roll commented Mar 11, 2024

Hi @peterdesmet,

WDYT? Does it make sense?

@nichtich
Copy link

Sorry, but this reads like "please use JSON and name the file datapackage.json but you can also use anything else" so nothing is guaranteed. The fact that JSON can internally be serialized in YAML, TOML etc. should better be a sidenote but not part of the specification.

content/docs/specifications/data-package.md Outdated Show resolved Hide resolved
content/docs/specifications/data-package.md Outdated Show resolved Hide resolved
content/docs/specifications/data-resource.md Outdated Show resolved Hide resolved
content/docs/specifications/table-dialect.md Outdated Show resolved Hide resolved
content/docs/specifications/table-schema.md Outdated Show resolved Hide resolved
@peterdesmet
Copy link
Member

The extra text in this PR makes sense. But it also includes relaxing the format and the datapackage.json name. I think it is fine to relax the format for internal use, but the text should be clearer on "publishing" (the "The above states..." is unclear to me). Personally, when provided/published as a JSON file, I think it should remain being called datapackage.json.

@roll
Copy link
Member Author

roll commented Mar 14, 2024

Thanks @peterdesmet!
I've merged your suggestions at it is

@roll roll changed the title Clarify serialization and discoverability [Draft] Clarify serialization and discoverability Mar 28, 2024
@roll
Copy link
Member Author

roll commented Mar 28, 2024

I'll put this under the draft label to take some extra time working on it as, it will be probably better if we move the definitions in one place (like glossary/terminology spec) instead of duplicating it in all the specs

@roll
Copy link
Member Author

roll commented Apr 3, 2024

CLOSED in favour of #50

@roll roll closed this Apr 3, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
3 participants