Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DPL-048-2: malformed root_sample_ids without duplicates in MLWH, unpicked #536

Closed
3 tasks
Jonnie-Bevan opened this issue Mar 17, 2022 · 1 comment
Closed
3 tasks
Assignees
Labels
Data integrity data fix Enhancement New feature or request

Comments

@Jonnie-Bevan
Copy link

Jonnie-Bevan commented Mar 17, 2022

User story
Part of the wider DPL-048-1 issue which spawned in turn from DPL-048. Related are DPL-0483-3, DPL-0483-4 and DPL-0483-5.

This story concerns the 4,699 malformed root_sample_ids in the MLWH lighthouse_sample table. These came from the MK lighthouse lab in August 2021 and have an extra substring (something like '_RNA123456789') concatenated on the end of the correct ID.

Fix
Since these samples were never picked they are not in SequenceScape, Event Warehouse or the MLWH sample table, so the only places that these need to be fixed are in the MongoDB sample table and MLWH lighthouse_sample table.

These can be fixed by using a script to correct the ID (remove everything after underscore) and then updating the relevant row with the correct ID.

Who are the primary contacts for this story
Jonnie B
Alan K

Acceptance criteria

  • data are fixed in MLWH lighthouse_sample table
  • data are fixed in MongoDB sample table
  • have double-checked that this data does not exist in SS/Ev. Warehouse/MLWH Sample table.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data integrity data fix Enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants