Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset YAML schema #29

Open
danjjl opened this issue Oct 28, 2024 · 1 comment
Open

Dataset YAML schema #29

danjjl opened this issue Oct 28, 2024 · 1 comment
Labels
datasets documentation Improvements or additions to documentation

Comments

@danjjl
Copy link
Member

danjjl commented Oct 28, 2024

Similarly to algorithms, datasets should be listed in dataset.yaml entries.
The schema for datasets should be produced. It can be based on the algorithm schema.

It should also include statistics on datasets, e.g.:

  • Number of subjects
  • Number of seizures
  • Avg recording duration
  • EEG Montage
  • Total hours of recording
@danjjl danjjl added documentation Improvements or additions to documentation datasets labels Oct 28, 2024
@danjjl
Copy link
Member Author

danjjl commented Oct 31, 2024

Several options seem possible:

  • DatasetJSON (as defined by CDISC) -- No json.schema
  • Based on CIFF -- no definition of th number of subjects, recording variables, ...
  • Based on BIDS dataset_description.json -- very little information in the file
  • Schema.org Dataset definition -- No json.schema, lack of variables, still might be the most commonly accepted solution as it seems well accepted

@danjjl danjjl mentioned this issue Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datasets documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant