Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hash function in minHash.c may not be the best choice #1

Open
elmtree8 opened this issue Jun 8, 2017 · 0 comments
Open

Hash function in minHash.c may not be the best choice #1

elmtree8 opened this issue Jun 8, 2017 · 0 comments

Comments

@elmtree8
Copy link
Collaborator

elmtree8 commented Jun 8, 2017

I implemented the first function from this page which works well historically, didn't cause any collisions on my small sample, and produced hashes that gave good results for min_hash_sim.py. However, reading this page makes me wonder if we can work on finding a better one in the future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant