-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large expansions without reads covering the entire repeat region #44
Comments
Hi Ben, Thanks for a great question. That's right, TRGT only uses reads that span the entire repeat region. We are definitely planning to add support for reads that partially overlap the repeat in the future versions of TRGT. Do you by any chance have a sample with a known pathogenic repeat expansion exceeding HiFi read length? Originally we planned to add support for very long repeats much earlier, but then it turned out that all very long expansions we had access to were detectable with the current TRGT approach. Perhaps there is a tendency for long repeat expansions to be highly mosaic and hence allowing us to fully capture the expanded alleles within 15Kb+ reads? (This of course applies to known pathogenic repeats and not to other very long repeats in the human genome.) Best wishes, |
Hi Egor, thanks for the quick response. I was hoping that you'd add this functionality. I am going to shelf my validation data for now, but will be happy to pick this up when you make modifications to the algorithm. Thanks again! |
Hi Ben, I see, thanks. Does your hybrid capture protocol involve PCR amplification? In my experience, PCR can lead to complete or nearly complete dropout of the expanded alleles. If you’d like, we could create a one-off version of TRGT that uses flanking reads to help evaluate your data. Let’s connect by email if this is something you’d like to explore? Best wishes, |
Hi Egor, yes, like probably every hybrid capture protocol this one includes a few cycles of PCR. Under-representation of expanded alleles has to be expected, you are right. That is why we are looking at cranking up sensitivity as much as possible. We will have to monitor the effect on precision. Regards, |
Hi,
Thank you for this great tool and for diligently answering questions here. If I understand correctly, TRGT only considers reads in the analysis that span the entire repeat region. It therefore fails to report any haplotype where the repeat expansion exceeds the read length - is that correct?
Are there any plans to also report evidence about these large repeats in the future? You could give at least a lower limit of its size.
Regards,
Ben
The text was updated successfully, but these errors were encountered: