A repository dedicated to pipelines used to turn basic genomic data into formatted Ensembl core databases. Also allow users to dump core databases into various formats.
File formats handled : FastA, GFF3, JSON (following BRC4 specifications).
Check out installation section for further information on how to install the project.
Check out the usage section for further information of requirements to run ensembl-genomio pipelines.
- Genome loader: Creates an Ensembl core database from a set of flat files.
- Genome dumper: Dumps flat files from an Ensembl core database.
- Additional seq prepare: BRC/Ensembl metazoa pipeline. Preparation of genome data loading files for new sequence(s) to existing species databases.
- Genome Prepare: BRC/Ensembl metazoa pipeline. Retrieve data for genome(s), obtained from INSDC and RefSeq, validate and prepare GFF3, FASTA, JSON files for each genome accession.
Software as part of Ensembl GenomIO is distributed under the Apache-2.0 License.