-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update dev #46
update dev #46
Conversation
Both unit tests and full tests successful locally. The final output table for a full run looks as below:
|
@muffato Once we figure out the correct |
|
(adding @BethYates so that she can comment on the output of the pipeline and how data are retrieved / computed) |
I think this has most of the information I expect to be able to get from the genome afterparty pipeline. There is also some information I wasn't expecting to be able to source from here but can certainly use for the genome note. The only thing I'm not seeing (other than BlobTool Kit data obviously) is the transcript mappability value, is this something this pipeline will produce? |
@muffato and I had a discussion on the topic of mappability and it is not going to be a number but rather a track on the contact maps. Yes that will be done for the genome using a different subworkflow #23. That would be genome mappability though not transcript mappability. Transcript mappability requires RNAseq data from what I understand and we don't have enough of those yet to implement those in our pipeline, but eventually the same/similar workflow as above can be used to generate that. Does that make sense? |
bin/summary_table.py
Outdated
for chrom in [[mol["assigned_molecule_location_type"], mol["length"], mol["gc_percent"]] for mol in seq if "gc_percent" in mol and mol["assembly_unit"] == "non-nuclear"]: | ||
writer.writerow(chrom) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This could undergo a small rewrite like the other for+for+if line
Co-authored-by: Matthieu Muffato <[email protected]>
Co-authored-by: Matthieu Muffato <[email protected]>
Co-authored-by: Matthieu Muffato <[email protected]>
Co-authored-by: Matthieu Muffato <[email protected]>
Moved back to a single table module which accepts optional inputs. Testing with all inputs and without merqury works. Cannot find an example for without busco. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The script is great, thanks !
Co-authored-by: Matthieu Muffato <[email protected]>
Co-authored-by: Matthieu Muffato <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For the record, create_table.nf
was renamed to createtable.nf
to adhere to the naming conventions.
bedtools_bamtobed
config settings #48, N50 from GoaT not always available #49, Updatecreate_table
to deal with optional inputs #50, Removebedtools_sort
#51PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).