Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

experimental cellxgene-schema CLI must update gene references to Ensembl 113 release #1127

Open
3 tasks
brianraymor opened this issue Nov 22, 2024 · 0 comments
Open
3 tasks

Comments

@brianraymor
Copy link
Contributor

Note

  • Add gene reference for Caenorhabditis elegans
  • Update gene reference for Danio rerio
  • Update gene reference for Drosophila melanogaster

Design

Required Gene Annotations

ENSEMBL identifiers are required for genes and External RNA Controls Consortium (ERCC) identifiers for RNA Spike-In Control Mixes to ensure that all datasets measure the same features and can therefore be integrated.

The following gene annotation dependencies are pinned for this version of the schema. For multi-organism experiments, cells from any Metazoan organism are allowed as long as orthologs from the following organism annotations are used.

Organism Source Required version Download
"NCBITaxon:6239"
for Caenorhabditis elegans
ENSEMBL (Caenorhabditis elegans) WBcel235 (GCA_000002985.3)
Ensembl 113
Caenorhabditis_elegans.WBcel235.113.gtf
NCBITaxon:7955
for Danio rerio
ENSEMBL (Zebrafish) GRCz11 (GCA_000002035.4)
Ensembl 113
Danio_rerio.GRCz11.113.gtf
"NCBITaxon:7227"
for Drosophila melanogaster
ENSEMBL (Fruit fly) BDGP6.46 (GCA_000001215.4)
Ensembl 113
Drosophila_melanogaster.BDGP6.46.113.gtf
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant