-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Construction a panTElib #8
Comments
Hi yfchen, Thank you for your interest in HiTE.
Yes, I believe replacing EDTA with HiTE should easily allow you to create a panHiTE. In fact, panHiTE is already on our to-do list. For now, you can achieve your goal by directly using HiTE in place of EDTA.
Theoretically, using a complete full-length TE library allows for the annotation of both full-length and fragmented (non-full-length) TEs in the genome. Since fragmented TEs originate from full-length TEs, they share high homology and can be easily annotated using RepeatMasker.
I do not recommend adding unknown TEs into HiTE unless their reliability can be ensured. For a more comprehensive and accurate annotation, I suggest combining the Repbase and HiTE libraries. In our next update, we will add a Best, |
Thank you very much for your prompt and detailed response. I am excited to hear about the upcoming updates and the potential to create a panHiTE. I appreciate your suggestions and will look forward to utilizing the --curated_lib parameter in the next version. |
Hi @porkfan, I wanted to let you know that we've just added a Best regards, Kang |
Hello @porkfan, I hope this message finds you well. Thank you for your continued support of HiTE. As we plan to develop panHiTE, we aim to address the key challenges users face and would value your insights on this. Could you please share what specific problems you would like panHiTE to solve or what types of output files you would find most useful? Best regards, |
Of course! I have added you on WeChat, and we can have more in-depth discussions there. I'm really looking forward to your future work. |
Dear Professor Hukang,
I am delighted to see that your work has developed a pipeline capable of accurately identifying full-length TEs. I have also been troubled by the overly fragmented TEs annotated by previous software like EDTA2. However, I believe EDTA2 has its advantages, such as their panEDTA pipeline, which allows the construction of a TE library at the pangenome level. This enables the use of a single library to annotate multiple genomes, facilitating comparison and analysis.
I would like to know if it is possible to use the panEDTA pipeline to cluster libraries constructed by HiTE for multiple genomes, thereby generating a panTElib. Perhaps you could consider adding this functionality in future updates? Additionally, I noticed that your article and the peer review comments highlight the high annotation accuracy of your pipeline. However, focusing only on full-length TEs may overlook many non-full-length TEs that are still abundant in the genome. Could I combine your annotation results with the more comprehensive results from EDTA2 to achieve a more complete and accurate annotation?
Thank you again for your work. Best wishes!
yfchen
The text was updated successfully, but these errors were encountered: