Skip to content

Commit

Permalink
Tweak spacing in project yml
Browse files Browse the repository at this point in the history
  • Loading branch information
StevenCannon-USDA committed Oct 13, 2023
1 parent 9590c91 commit d6fbf19
Show file tree
Hide file tree
Showing 2 changed files with 107 additions and 42 deletions.
73 changes: 50 additions & 23 deletions _data/projects/Valliyodan_Brown_2021/project_details.yml
Original file line number Diff line number Diff line change
@@ -1,33 +1,60 @@
---
-
DataStoreID: "Valliyodan_Brown_2021"
short_title: "Cultivar resequencing of 481 diverse accessions"
SoyBaseID: "SoyBase.B2014.02"
title: "Large Scale Sequencing of Germplasm to Develop Genomic Resources for Soybean Improvement"
description: "This project presents the results of resequencing and genotyping 481 of diverse soybean accessions."
BioProjectID: "PRJNA289660, PRJNA384190"
- DataStoreID: Valliyodan_Brown_2021
short_title: Cultivar resequencing of 481 diverse accessions
SoyBaseID: SoyBase.B2014.02
title: Large Scale Sequencing of Germplasm to Develop Genomic Resources for
Soybean Improvement
description: This project presents the results of resequencing and genotyping
481 of diverse soybean accessions.
BioProjectID: PRJNA289660, PRJNA384190
publications:
- citation: "Valliyodan B, Brown AV, Wang J, Patil G, Liu Y, Otyama PI, Nelson RT, Vuong T, Song Q, Musket TA, Wagner R, Marri P, Reddy S, Sessions A, Wu X, Grant D, Bayer PE, Roorkiwal M, Varshney RK, Liu X, Edwards D, Xu D, Joshi T, Cannon SB, Nguyen HT. Genetic variation among 481 diverse soybean accessions, inferred from genomic re-sequencing. Sci Data. 2021 Feb 8;8(1):50."
doi: "10.1038/s41597-021-00834-w"
abstract: "A significant effort to sequence a large number of soybean germplasm lines in the U.S. through the project titled \"Large Scale Sequencing of Germplasm to Develop Genomic Resources for Soybean Improvement\", generated genome sequence information for the most diverse soybean lines from the USDA-GRIN germplasm collection. This sequencing project was supported by the United Soybean Board and three private companies: Bayer CropScience, DOW AgroSciences, LLC, and Monsanto. Dr. Henry Nguyen, Division of Plant Sciences, College of Agriculture, Food and Natural Resources, University of Missouri is the Principal Investigator of this project and Dr. Babu Valliyodan, Nguyen's Laboratory at the University of Missouri is coordinating this research. This is the first large-scale public-private partnership of this magnitude in the soybean genetic research arena. The results of this project will lay the groundwork for future soybean genetics and breeding research. As part of this project, the southern U.S. soybean cultivar Lee (PI 548656) was selected for sequencing to create a second reference genome in soybean. This will complement the first reference genome, Williams82, that was chosen to represent the northern U.S. germplasm. The data generated by this project will benefit the soybean community and allow both public and private soybean breeders and researchers to use these resources to improve soybean varieties for U.S. farmers."
- citation: Valliyodan B, Brown AV, Wang J, Patil G, Liu Y, Otyama PI, Nelson RT,
Vuong T, Song Q, Musket TA, Wagner R, Marri P, Reddy S, Sessions A, Wu
X, Grant D, Bayer PE, Roorkiwal M, Varshney RK, Liu X, Edwards D, Xu D,
Joshi T, Cannon SB, Nguyen HT. Genetic variation among 481 diverse
soybean accessions, inferred from genomic re-sequencing. Sci Data. 2021
Feb 8;8(1):50.
doi: 10.1038/s41597-021-00834-w
abstract: "A significant effort to sequence a large number of soybean germplasm
lines in the U.S. through the project titled \"Large Scale Sequencing of
Germplasm to Develop Genomic Resources for Soybean Improvement\",
generated genome sequence information for the most diverse soybean lines
from the USDA-GRIN germplasm collection. This sequencing project was
supported by the United Soybean Board and three private companies: Bayer
CropScience, DOW AgroSciences, LLC, and Monsanto. Dr. Henry Nguyen,
Division of Plant Sciences, College of Agriculture, Food and Natural
Resources, University of Missouri is the Principal Investigator of this
project and Dr. Babu Valliyodan, Nguyen's Laboratory at the University
of Missouri is coordinating this research. This is the first large-scale
public-private partnership of this magnitude in the soybean genetic
research arena. The results of this project will lay the groundwork for
future soybean genetics and breeding research. As part of this project,
the southern U.S. soybean cultivar Lee (PI 548656) was selected for
sequencing to create a second reference genome in soybean. This will
complement the first reference genome, Williams82, that was chosen to
represent the northern U.S. germplasm. The data generated by this
project will benefit the soybean community and allow both public and
private soybean breeders and researchers to use these resources to
improve soybean varieties for U.S. farmers."
data_links:
- category:
- name: information
instances:
- description: "The list of soybean lines that have been resequenced (spreadsheet)"
url: "https://www.soybase.org/data/v2/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/Wm82.gnm2.div.Valliyodan_Brown_2021USB481_accession_List.xlsx"
- name: information
instances:
- description: The list of soybean lines that have been resequenced (spreadsheet)
url: https://www.soybase.org/data/v2/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/Wm82.gnm2.div.Valliyodan_Brown_2021USB481_accession_List.xlsx
- category:
- name: data
instances:
- description: "Genetic variant matrix (vcf format) and associated data for 481 accessions (Data Store)"
url: "https://soybase.org/data/v2/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/"
- description: "Genetic variant matrix (vcf format) and associated data for 481 accessions (National Ag Library)"
url: "https://data.nal.usda.gov/dataset/data-genetic-variation-among-481-diverse-soybean-accessions"
- description: Genetic variant matrix (vcf format) and associated data for 481
accessions (Data Store)
url: https://soybase.org/data/v2/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/
- description: Genetic variant matrix (vcf format) and associated data for 481
accessions (National Ag Library)
url: https://data.nal.usda.gov/dataset/data-genetic-variation-among-481-diverse-soybean-accessions
- category:
- name: sequence reads
instances:
- description: "SRA Files germplasm set 1 (SRP062245)"
url: "https://www.ncbi.nlm.nih.gov/sra/?term=SRP062245"
- description: "SRA Files germplasm set 2 (SRP105183)"
url: "https://www.ncbi.nlm.nih.gov/sra/?term=SRP105183"

- description: SRA Files germplasm set 1 (SRP062245)
url: https://www.ncbi.nlm.nih.gov/sra/?term=SRP062245
- description: SRA Files germplasm set 2 (SRP105183)
url: https://www.ncbi.nlm.nih.gov/sra/?term=SRP105183
76 changes: 57 additions & 19 deletions _data/projects/Valliyodan_Cannon_2016/project_details.yml
Original file line number Diff line number Diff line change
@@ -1,27 +1,65 @@
---
-
DataStoreID: "Valliyodan_Cannon_2016"
short_title: "Lee and PI 483463 Reference Genomes"
SoyBaseID: "SoyBase.B2018.01"
title: "High-Quality Genome Assemblies for G. max cultivar 'Lee' and G. soja accession PI 483463"
description: "High-quality genome assemblies for soybean (Glycine max) and wild soybean (Glycine soja). These provide complements to the primary reference assembly for Glycine max cv. Williams 82 (Wm82.a2). The G. max assembly is for cultivar Lee, which has been used a parent in many southern U.S. breeding projects. The G. soja assembly is for accession PI 483463. This line was chosen for its high genotypic dissimilarity with respect to cultivated soybean. It originates from Shanxi Province, in north-central China."
BioProjectID: "PRJNA407817, PRJNA407822"
- DataStoreID: Valliyodan_Cannon_2016
short_title: Lee and PI 483463 Reference Genomes
SoyBaseID: SoyBase.B2018.01
title: High-Quality Genome Assemblies for G. max cultivar 'Lee' and G. soja
accession PI 483463
description: High-quality genome assemblies for soybean (Glycine max) and wild
soybean (Glycine soja). These provide complements to the primary reference
assembly for Glycine max cv. Williams 82 (Wm82.a2). The G. max assembly is
for cultivar Lee, which has been used a parent in many southern U.S.
breeding projects. The G. soja assembly is for accession PI 483463. This
line was chosen for its high genotypic dissimilarity with respect to
cultivated soybean. It originates from Shanxi Province, in north-central
China.
BioProjectID: PRJNA407817, PRJNA407822
publications:
- citation: "Valliyodan B, Cannon SB, Bayer PE, Shu S, Brown AV, Ren L, Jenkins J, Chung CY, Chan TF, Daum CG, Plott C, Hastie A, Baruch K, Barry KW, Huang W, Patil G, Varshney RK, Hu H, Batley J, Yuan Y, Song Q, Stupar RM, Goodstein DM, Stacey G, Lam HM, Jackson SA, Schmutz J, Grimwood J, Edwards D, Nguyen HT. Construction and comparison of three reference-quality genome assemblies for soybean. Plant J. 2019 Dec;100(5):1066-1082."
doi: "10.1111/tpj.14500"
abstract: "We report reference-quality genome assemblies and annotations for two accessions of soybean (Glycine max) and for one accession of Glycine soja, the closest wild relative of G. max. The G. max assemblies provided are for widely used US cultivars: the northern line Williams 82 (Wm82) and the southern line Lee. The Wm82 assembly improves the prior published assembly, and the Lee and G. soja assemblies are new for these accessions. Comparisons among the three accessions show generally high structural conservation, but nucleotide difference of 1.7 single-nucleotide polymorphisms (snps) per kb between Wm82 and Lee, and 4.7 snps per kb between these lines and G. soja. snp distributions and comparisons with genotypes of the Lee and Wm82 parents highlight patterns of introgression and haplotype structure. Comparisons against the US germplasm collection show placement of the sequenced accessions relative to global soybean diversity. Analysis of a pan-gene collection shows generally high conservation, with variation occurring primarily in genomically clustered gene families. We found approximately 40-42 inversions per chromosome between either Lee or Wm82v4 and G. soja, and approximately 32 inversions per chromosome between Wm82 and Lee. We also investigated five domestication loci. For each locus, we found two different alleles with functional differences between G. soja and the two domesticated accessions. The genome assemblies for multiple cultivated accessions and for the closest wild ancestor of soybean provides a valuable set of resources for identifying causal variants that underlie traits for the domestication and improvement of soybean, serving as a basis for future research and crop improvement efforts for this important crop species."
- citation: Valliyodan B, Cannon SB, Bayer PE, Shu S, Brown AV, Ren L, Jenkins J,
Chung CY, Chan TF, Daum CG, Plott C, Hastie A, Baruch K, Barry KW, Huang
W, Patil G, Varshney RK, Hu H, Batley J, Yuan Y, Song Q, Stupar RM,
Goodstein DM, Stacey G, Lam HM, Jackson SA, Schmutz J, Grimwood J,
Edwards D, Nguyen HT. Construction and comparison of three
reference-quality genome assemblies for soybean. Plant J. 2019
Dec;100(5):1066-1082.
doi: 10.1111/tpj.14500
abstract: "We report reference-quality genome assemblies and annotations for two
accessions of soybean (Glycine max) and for one accession of Glycine
soja, the closest wild relative of G. max. The G. max assemblies
provided are for widely used US cultivars: the northern line Williams 82
(Wm82) and the southern line Lee. The Wm82 assembly improves the prior
published assembly, and the Lee and G. soja assemblies are new for these
accessions. Comparisons among the three accessions show generally high
structural conservation, but nucleotide difference of 1.7
single-nucleotide polymorphisms (snps) per kb between Wm82 and Lee, and
4.7 snps per kb between these lines and G. soja. snp distributions and
comparisons with genotypes of the Lee and Wm82 parents highlight
patterns of introgression and haplotype structure. Comparisons against
the US germplasm collection show placement of the sequenced accessions
relative to global soybean diversity. Analysis of a pan-gene collection
shows generally high conservation, with variation occurring primarily in
genomically clustered gene families. We found approximately 40-42
inversions per chromosome between either Lee or Wm82v4 and G. soja, and
approximately 32 inversions per chromosome between Wm82 and Lee. We also
investigated five domestication loci. For each locus, we found two
different alleles with functional differences between G. soja and the
two domesticated accessions. The genome assemblies for multiple
cultivated accessions and for the closest wild ancestor of soybean
provides a valuable set of resources for identifying causal variants
that underlie traits for the domestication and improvement of soybean,
serving as a basis for future research and crop improvement efforts for
this important crop species."
data_links:
- category:
- name: data
- name: data
instances:
- description: "Genome assembly for Glycine max cv. Lee"
url: "https://soybase.org/data/v2/Glycine/max/genomes/Lee.gnm1.BXNC/"
- description: "Genome assembly for Glycine max PI 483463"
url: "https://soybase.org/data/v2/Glycine/soja/genomes/PI483463.gnm1.YJWS/"
- description: Genome assembly for Glycine max cv. Lee
url: https://soybase.org/data/v2/Glycine/max/genomes/Lee.gnm1.BXNC/
- description: Genome assembly for Glycine max PI 483463
url: https://soybase.org/data/v2/Glycine/soja/genomes/PI483463.gnm1.YJWS/
- category:
- name: browser
instances:
- description: "Genome assembly for Glycine max PI 483463"
url: "https://soybase.org/gb2/gbrowse/glyma.Lee.gnm1/"
- description: "Genome assembly for Glycine max PI 483463"
url: "https://soybase.org/gb2/gbrowse/glyso.PI483463.gnm1/"
- description: Genome assembly for Glycine max PI 483463
url: https://soybase.org/gb2/gbrowse/glyma.Lee.gnm1/
- description: Genome assembly for Glycine max PI 483463
url: https://soybase.org/gb2/gbrowse/glyso.PI483463.gnm1/

0 comments on commit d6fbf19

Please sign in to comment.