Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allocation into lineages for metazoan LTR-RTs #51

Open
alexandrosbousios opened this issue Jul 26, 2023 · 4 comments
Open

Allocation into lineages for metazoan LTR-RTs #51

alexandrosbousios opened this issue Jul 26, 2023 · 4 comments

Comments

@alexandrosbousios
Copy link

Hi Ren-Gang,

This issue may partially overlap with previous questions, but I think it will help if it shows up separately here.

Is there any progress/updates on allocating animal LTR-RTs into lineages (SIRE, Ale, Tekay etc.) as you successfully do in plants, or this is yet not possible?

Related to this, what is the purpose of selecting -db rexdb-metazoa instead of rexdb-plants? I suppose that it is helping towards a better allocation into Copia, Ty3, or unknown LTR-RTs, correct?

Could you also please clarify (and maybe add a note in the main page of what is rexdb-tir and rexdb-pnas? Apologies if this information is somewhere but I've missed it.

Also a request: could you add an output file in TEsorter that the user can easily select the fasta files of the full-length elements (i.e. the original input file) that are SIRE, or ATHILA etc.? That will be very handy if someone is interested in further analyzing a specific lineage.

Thanks,
Alex

@zhangrengang
Copy link
Owner

Hi Alex,
The lineage-level classification relies on the database. There is no update of databases at present, but GyDB may provide some details for animal. It is possible to create such a database for animal, but I am not familiar to this.

-db rexdb-metazoa provides a metazoa subset of REXdb (similarly, rexdb-plant is a plant subset of REXdb). It may be more specific for animal.

rexdb-pnas can be referred to https://github.com/zhangrengang/TEsorter#citations (the prefix rexdb may be confused and I will revise the name in future). rexdb-tir is a DNA/TIR-element subset of REXdb. It is for test purpose and now is not available in the last version.

The request may be implemented with get_record.py in the package. For example:

cat rice6.9.5.liban.rexdb.dom.tsv | grep -P "\-RT\t" | grep SIRE | get_record.py -i rice6.9.5.liban.rexdb.dom.faa -o rice6.9.5.liban.rexdb.dom.SIRE-RT.faa -t fasta

@alexandrosbousios
Copy link
Author

Hi Ren-Gang,

Thanks for the clarifications; take-on message is that it is still not possible to allocate animal LTR-RTs into lineages yet. Hopefully, some animal TE labs will take the plunge in a way that Neumann et al (2019) and others before them did for plants.

Your script is very helpful and I've missed it, but my request was for retrieving the sequence of the full-length elements, not their genes!

Best,
Alex

@zhangrengang
Copy link
Owner

Hi,Alex. You may just extract the sequence id of the full-length elements (i.e. from rice6.9.5.liban.rexdb.cls.tsv), and then use get_record.py to extract the sequences from the input. It should be a similar process.

@alexandrosbousios
Copy link
Author

Thanks Ren-Gang!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants