Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicate exons, overlapping CDSs in output #26

Open
nikostr opened this issue Oct 25, 2024 · 0 comments
Open

Duplicate exons, overlapping CDSs in output #26

nikostr opened this issue Oct 25, 2024 · 0 comments

Comments

@nikostr
Copy link

nikostr commented Oct 25, 2024

One piece of my output looks as follows:

ptg000006l	LiftOn	gene	86505371	86507566	.	-	.	ID=g31624;source=Liftoff
ptg000006l	LiftOn	transcript	86505371	86507566	.	-	.	ID=g31624.t1;Parent=g31624;mutation=frameshift,stop_codon_gain;protein_identity=0.718;dna_identity=0.652;status=LiftOn_chaining_algorithm
ptg000006l	LiftOn	exon	86505371	86505373	.	-	.	ID=exon_138261;Parent=g31624.t1
ptg000006l	LiftOn	exon	86505371	86505373	.	-	.	ID=exon_138261;Parent=g31624.t1
ptg000006l	LiftOn	exon	86506533	86507566	.	-	.	ID=exon_138262;Parent=g31624.t1
ptg000006l	LiftOn	exon	86506650	86507566	.	-	.	ID=exon_138263;Parent=g31624.t1
ptg000006l	LiftOn	CDS	86507413	86507566	1636	-	1	Parent=g31624.t1
ptg000006l	LiftOn	CDS	86506650	86507566	.	-	0	Parent=g31624.t1

This does not look right (duplicate exons, overlapping CDSs), and it causes gffread to crash when attempting to extract protein sequences. I ran the following command:

lifton \
    -g $GFF \
    -o sample.lifton.gff3 \
    -copies \
    --threads 10 \
    $TARGET \
    $REF

using LiftOn v1.0.5. Unfortunately I can't currently share my input data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant