You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
CDC selects samples for their dashboard (and models) that have a complete S gene. We need to begin tracking the S gene so we know how many of our samples are making it to the CDC dashboard, so we are meeting our deliverables. We need to incorporate this into our wdl and makes sure we are getting usable outputs.
Solution
Notes from data meeting
we probably want to use the consensus sequence to calculate the percent coverage; samtools coverage doesn't allow a min depth, so the percent coverage would be inflated.
Nextclade (dev version) - nextclade.csv file provides percent coverage per gene available (with some required parsing).
to get coverage across amplicons, use the alignment consensus sequence and coordinates of the amplicon regions.
weird consensus calling with medaka
we will wait to see what Sam comes up with regarding the analysis he is performing using past data.
Upstream effects
None(?)
Downstream effects
Updating BigQuery Data Transfers to account for the new column headers of the results summary file.
The text was updated successfully, but these errors were encountered:
molly-hetheringtonrauth
changed the title
[REQUIREMENT] S gene coverage depth for data tracking and for QC notebook
[FEATURE] S gene coverage
Sep 18, 2024
Time estimates for subsections:
WDL changes and testing - 24
Bigquery schema updates - 18
BigQuery table and Tableau dashboard - 12
Any other possible breaks - 16
Feature Request
CDC selects samples for their dashboard (and models) that have a complete S gene. We need to begin tracking the S gene so we know how many of our samples are making it to the CDC dashboard, so we are meeting our deliverables. We need to incorporate this into our wdl and makes sure we are getting usable outputs.
Solution
Notes from data meeting
samtools coverage
doesn't allow a min depth, so the percent coverage would be inflated.nextclade.csv
file provides percent coverage per gene available (with some required parsing).Upstream effects
None(?)
Downstream effects
The text was updated successfully, but these errors were encountered: