Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Encounters ETL - Superset #11

Open
csc5k opened this issue Nov 1, 2017 · 9 comments
Open

New Encounters ETL - Superset #11

csc5k opened this issue Nov 1, 2017 · 9 comments
Assignees

Comments

@csc5k
Copy link

csc5k commented Nov 1, 2017

Creating a ticket to track the new extract of encounters we're loading into Superset. Each row in the extract should be a unique encounter. The extract should include the following columns:

  • encounter_id - Encounter ID
  • encounter_date - Encounter Date
  • zeir_id - ZEIR ID
  • Gender (male/female) - do we want to do male 1 and female 1?
  • date of birth
  • Mother id
  • Child HIV exposure
  • facility_id
  • facility name
  • district id
  • district name
  • province_name
  • province_id
  • provider id
  • provider name
  • child weight
  • 1/NULL for each of the following vaccines/encounter types:
    • BCG 1
    • OPV 0
    • OPV 1
    • PCV 1
    • Penta 1
    • Rota 1
    • OPV 2
    • PCV 2
    • Penta 2
    • Rota 2
    • OPV 3
    • PCV 3
    • Penta 3
    • Measles 1
    • MR 1
    • OPV 4
    • Measles 2
    • MR 2
    • BCG 2
    • vitamin A - not a vaccine, but an encounter
    • mebendezol (anti-worm) - not a vaccine, but an encounter
    • child weighed? - not a vaccine, but an encounter

At a later date, we will look to expand the encounter info to determine whether a vaccine was given on time or late

@csc5k csc5k changed the title Updates to Encounters ETL New Encounters ETL - Superset Nov 2, 2017
@mberg
Copy link

mberg commented Nov 2, 2017

Clay - can we add a slug and label for each one.

Also let's group some of these. Ie) child_weighed - true/false and child_weight - 8.4 kg

@AndreLesa
Copy link

Let's add the province name in there too.

@mberg
Copy link

mberg commented Nov 2, 2017 via email

@csc5k
Copy link
Author

csc5k commented Nov 2, 2017 via email

@csc5k
Copy link
Author

csc5k commented Nov 2, 2017

See the attached spreadsheet for a list of data elements incl slug/column header, readable name, data type, and detail.

Currently we're including both ID and name of things like provider, district, facility, etc. - are the IDs necessary or should we just focus on name?

https://docs.google.com/a/ona.io/spreadsheets/d/1qwNzQRsABqB7orYblruTqNUBo8a221pVYas1s5cYxH4/edit?usp=sharing

ZEIR Encounter ETL Spec.xlsx

@AndreLesa
Copy link

AndreLesa commented Nov 2, 2017 via email

@csc5k
Copy link
Author

csc5k commented Nov 2, 2017

OK cool. To be clear, I removed the following (updated version also linked and attached)

facility ID
district ID
province ID
provider ID

https://docs.google.com/a/ona.io/spreadsheets/d/1qwNzQRsABqB7orYblruTqNUBo8a221pVYas1s5cYxH4/edit?usp=sharing

ZEIR Encounter ETL Spec (1).xlsx

@csc5k csc5k mentioned this issue Nov 3, 2017
@csc5k
Copy link
Author

csc5k commented Nov 3, 2017

Just pinging @mberg @AndreLesa to see if they have further feedback on the specification for the encounters ETL.

Note that in order to calculate the Dropout rate, we're going to create a separate ETL that transmits aggregate data for each date. This ETL is documented here: #12

This is due to the way line graphs in Superset aggregate numbers. I can explain in more detail if there are questions.

@csc5k
Copy link
Author

csc5k commented Nov 16, 2017

Reviewed output with @AndreLesa and @manutarus . We're going to update "child HIV exposure" column to return a YES/NO/NULL instead of POSITIVE/NEGATIVE/NULL

The column in Postgres will have the header "child_hiv_exposed"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants