Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DPL-840 Populate missing sample fields in MLWH and consequently in CoTrack #805

Open
neilsycamore opened this issue Jul 18, 2023 · 0 comments
Assignees
Labels
GSU Delivers work for the GSU unit Heron

Comments

@neilsycamore
Copy link

neilsycamore commented Jul 18, 2023

Request:
Would it be possible to have missing data corrected (with strong interest on having corrected the sample_data_received field) for the 9 samples? The identifiers of the samples affected are in the attached CSV
RT784027_identifiers.csv

Background:
As part of a Heron R&D test, the Heron/GLSP lab processed a series of non-consolidated negative plates with a few positives in them. They also processed a "normal" consolidated positive plate (BB-00044085) intended for the regular Heron pipeline work. The positives in both these plates were the same and therefore share the same root ID.

We have identified that some of the fields associated to the 9 samples from source plate BB-00044085 (like the plate_map_received, box_manifest_uploaded, box_scaned and sample_data_received) does not seem to have been populated on CoTrack. A co-worker from the GSU data engineering team has forwarded me a screenshot (attached below) showing a series of errors in the PSD pipeline processing the plate maps that the data engineer team put into RabbitMQ. We suspect that potentially the missing fields have not been set in MLWH and consequently missing on CoTrack as the samples had already been processed.
RT784027_exceptions

when(

        (f.col(""cherry_pick_completed"").isNotNull() | f.col(""cherry_pick_failed"").isNotNull()),

        f.greatest(""box_scanned"", ""plate_map_received"", ""box_manifest_uploaded"")).otherwise(



        f.when(f.col(""box_scanned"").isNotNull() & f.col(""plate_map_received"").isNotNull() & f.col(

            ""box_manifest_uploaded"").isNotNull(),

               f.greatest(""box_scanned"", ""plate_map_received"", ""box_manifest_uploaded"")).otherwise(

            f.lit(None)))

Main Contacts
Camilo G

Related:
Request raised via RT784027

@neilsycamore neilsycamore added Heron GSU Delivers work for the GSU unit labels Jul 18, 2023
@sdjmchattie sdjmchattie self-assigned this Jul 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
GSU Delivers work for the GSU unit Heron
Projects
None yet
Development

No branches or pull requests

2 participants