Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Numeric Values #14

Open
laiw12 opened this issue Oct 31, 2018 · 1 comment
Open

Numeric Values #14

laiw12 opened this issue Oct 31, 2018 · 1 comment

Comments

@laiw12
Copy link

laiw12 commented Oct 31, 2018

Hey, we are thinking of using your system in our project. It seems like only categorical attributes are calculated by the privbayes algorithm. The numeric values such as integer/float does not go through privbayes algorithm.

@haoyueping
Copy link
Collaborator

The numeric values are first grouped into categories before running the PrivBayes algorithm.

For example, column "age" has 5 values [12, 15, 24, 54, 57, 32], then "age_grouped" can be [10s, 10s, 20s, 50s, 50s, 30s].

In DataSynthesizer, the numeric values are grouped automatically by their histograms. Please let me know if you have any questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants