-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The difference between two genotyper algorithms #50
Comments
Hello, we recommend the use of the cluster genotyper when there are not many repeats to genotype. The cluster genotyper will compare all STR sequences with each other and cluster them, whereas the size genotyper splits reads based on the STR sequence size in each read to maximize the difference between alleles to the difference within them. The cluster genotyper uses more information but is significantly slower, so for WGS applications we recommend using size. The size genotyper is described in the TRGT manuscript, whereas the cluster genotyper algorithm paper is still a work in progress. Hope this helps! Happy to clarify further. |
Thank you so much!
and we cannot find any supporting reads in IGV. my code is |
Hello, Without looking at the reads, I'd guess, since this region is repetitive, that, when looking for flanking bases, some reads aligned to the repetitive sequence donwstream of this repeat. Maybe try to increase the |
Thank you for the suggestion. I tested the cluster algorithm, and found that this false positive event was absent. It seems that the cluster algorithm improves the repeat expansion detection a lot and is not such time-consuming as I expected. |
@ywzhang071394, thank you for letting us know. Would you be open to sharing a waterfall plot of this repeat with us? We could do it by email if you prefer. |
Hi, Sorry I need to reopen this issue, since I want to debug the size algorithm. I tried to increase --flank-len parameter to 500, but nothing was improved. Could you help comment on this? |
Hi,
Thank you for the nice tool! Could you give a introduction on the two genotyper algorithms "size" and "cluster".
I looked through your paper and github page but did not find any related info.
Thanks
The text was updated successfully, but these errors were encountered: