race conditions #14

kdm9 · 2024-07-09T16:14:55Z

Hello all,

There seem to be quite a few race conditions if one runs lifton in parallel. A project I'm working on requires running lifton from several dozen source annotations to several hundred references, and so I use snakemake to parallelise runs across a cluster. However (at least) the following race conditions appear:

If the output files are something like output/$SOURCE/$TARGET_NAME.gff, there's a race condition as lifton writes to output/$SOURCE/lifton_output regardless of which genome is being annotated, which corrupts the intermediate files.
It seems like at certain stages the gffutils sqlite database is written to, even if it already exists before creating (e.g. with ANALYSE). This causes race conditions and crashes as only one process can write to a sqlite db at once (normally).

With liftoff, one could work around these same issues because liftoff accepted a temp/intermediate directory name (so you could use e.g. output/$SOURCE/$TARGET_NAME/ instead of output/$SOURCE/lifton_output, making each job's directory unique). Liftoff also did not modify the gff database if it already existed, so if you pre-computed all needed gff_dbs before running any liftoff, then you were guaranteed not to have race conditions on the sqlite db.

I'd encourage you to adopt these workarounds in lifton.

best,
Kevin

The text was updated successfully, but these errors were encountered:

kdm9 · 2024-07-31T07:25:15Z

@Kuanhao-Chao any update on this issue?

Kuanhao-Chao · 2024-08-02T03:11:44Z

Hi @kdm9, I am currently on an internship and won't have time to fix this issue in August. I will get it back to you in September. Thanks for reporting this issue. It is indeed important to allow users to run LiftOn in parallel. Best, Kuan-Hao

…

On Wed, Jul 31, 2024 at 12:25 AM Dr. K. D. Murray ***@***.***> wrote: @Kuanhao-Chao <https://github.com/Kuanhao-Chao> any update on this issue? — Reply to this email directly, view it on GitHub <#14 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGG4TAC4NCXH2MHLMG3TR2TZPCGPBAVCNFSM6AAAAABKTGEQ62VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENJZHA2TAMJYGQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

kdm9 · 2024-08-02T07:34:50Z

OK, great, thanks in advance and enjoy your internship!

Kuanhao-Chao self-assigned this Jul 9, 2024

Kuanhao-Chao added bug Something isn't working feature_request labels Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

race conditions #14

race conditions #14

kdm9 commented Jul 9, 2024

kdm9 commented Jul 31, 2024

Kuanhao-Chao commented Aug 2, 2024 via email

kdm9 commented Aug 2, 2024

race conditions #14

race conditions #14

Comments

kdm9 commented Jul 9, 2024

kdm9 commented Jul 31, 2024

Kuanhao-Chao commented Aug 2, 2024 via email

kdm9 commented Aug 2, 2024