Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

zcta hospitalizations issue #2

Open
alinasmahl1 opened this issue Aug 18, 2021 · 3 comments
Open

zcta hospitalizations issue #2

alinasmahl1 opened this issue Aug 18, 2021 · 3 comments

Comments

@alinasmahl1
Copy link

@rl627
Some ZCTA's are missing monthly hospitalization data-- we believe this is because , "The rate of hospitalized cases per 100,000 people is suppressed for a specific geography when the count of hospitalized deaths is between 1 and 4 due to imprecise and unreliable estimates." In order to calculate cumulative rates, we will impute monthly missing.

we also need to update the readme to describe the hosp_per100k column, and include the imputation rule description.

@ran-codes
Copy link
Member

ran-codes commented Aug 19, 2021

The longitudinal rates data was incorrectly aggregated. This has been corrected (https://github.com/Drexel-UHC/covid_inequities_project/blob/main/Outcomes/totals/byZCTA_bchc.csv) ! See the distribution of hosp rates byZCTA for the three cities below. This updated data should now have correct data and no missing ZCTA for NYC. Let me know if things look okay to you.

image

@alinasmahl1
Copy link
Author

This looks great! What did you end up imputing the missing values to ? Want to make sure I"m giving the NYU folks all the detail on the data.

@ran-codes
Copy link
Member

psuedocode:

  1. for each month/ZCTA, calculate hospitalization_count from rate*population_denom. If that month/ZCTA was missing a rate (suppression) impute a hospitalization_count of 4 (going with 4 just to prevent underestimation but we can always change that to another number between 1-4).
  2. for each ZCTA, calculate cumulative hospitalization_count by summing all the monthly hospitalization_count
  3. for each ZCTA, the cumulative rate was calculated by (cumulative hospitalization_count)/pop_denom

I'll update this in the readMe's ETC sometime tomorrow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants