Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Restricting str-profile generation to contigs/regions provided in .bed or similar #50

Open
am8265 opened this issue Dec 15, 2021 · 1 comment

Comments

@am8265
Copy link

am8265 commented Dec 15, 2021

Hello developers,

Is it possible to add the feature of restricting to specific contigs/regions provided in .bed/similar while generating str-profile
using this command -
ExpansionHunterDenovo profile

Hopefully this will be useful for reducing runtime for bad incomplete contigs.

Thanks!

@egor-dolzhenko
Copy link
Contributor

Thanks for the suggestion! The main hurdle with implementing this is that the reads originating in expanded repeats on one contig often get misaligned to other contigs. To avoid jumping around a BAM file and recovering these reads, EHdn just streams through the entire BAM file. This usually ends up being much faster than a targeted analysis of a large number of regions.

However, if you are interested in profiling just a small number of regions, we could consider implementing a separate targeted workflow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants