Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use NLP to detect nationality statistics #35

Closed
jayvdb opened this issue Dec 6, 2017 · 3 comments
Closed

Use NLP to detect nationality statistics #35

jayvdb opened this issue Dec 6, 2017 · 3 comments

Comments

@jayvdb
Copy link
Member

jayvdb commented Dec 6, 2017

Use the participants display name to determine nationality.

Discard data which is unsuitable or confidence level is low, but clearly display what percentage of the participants display names were unsuitable or was unusable for the statistics.

wrt unsuitable data, if #32 doesnt land first, skip any display name which contains numerals and doesnt contain an uppercase letter.

To warm you up a bit, see here

Come onto the coala Zulip to discuss how to solve this.

@andrewda
Copy link
Member

andrewda commented Dec 6, 2017

This is really cool! If #8/#32 lands first, we'll also be able to use a user's location from GitHub.

@invisible-defects
Copy link
Member

Sounds really interesting! Have a couple ideas on this, will share them in Zulip soon.

@jayvdb jayvdb changed the title Use NLP to generate nationality statistics Use NLP to detect nationality statistics Dec 7, 2017
@jayvdb
Copy link
Member Author

jayvdb commented Apr 18, 2018

This task was revised slightly, and wont be done using NLP.

@jayvdb jayvdb closed this as completed Apr 18, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

No branches or pull requests

4 participants