Primitives specialization for Hasher #40

ogxd · 2023-12-06T16:00:20Z

Context

The Hasher trait exposes methods to hash primitives. Currently, we hash primitives by considering them all as slices of bytes. Hashing can be much performance if the type is known in advance (eg load primitive directly in SIMD vector).

Triggered by the following discussion: rust-lang/hashbrown#487

Goals

Hashing a primitive type is faster.
Hashing a primitive still follows the algorithm principles (and thus remains stable and passes SMHasher)

Todo

Add benchmark to test hashing on primitive, like u32
Implement all methods (write_u32, write_u64, ...)
- Make it work for ARM
- Make it work for x86
Publish benchmark results before/after on ARM/X86]

The text was updated successfully, but these errors were encountered:

ogxd · 2023-12-10T00:14:07Z

It seems the default write (non-specialized) wasn't that bad even on the smallest primitive types. On MacBook M1 pro, using write_u32 yields a +13% performance (which is still substantial).

Another interesting thing is that the hashset benchmark was biased in some cases. black_boxing the keys prevents compiler optimizations that made this bench biased.

ogxd · 2023-12-10T00:17:28Z

Current progress involves hashes that are stable in the context of the Hasher, however hashes for an u32 hashed via Hasher::write_u32 are not stable with hashes using the gxhash(&[u8], ...) method. I think this is acceptable because those are two very different contexts. SMHasher should still pass for both contexes.

ogxd · 2023-12-10T00:20:13Z

Fixed a SIGSEGV when passed [u8] is a null slice (not just an empty slice)

ogxd · 2023-12-10T09:57:02Z

Merging and releasing 2.3.0

On both my ARM and X86 platforms, I get about -13% of hashing time for small inputs (u8, u16, u32, u64, u128 and signed counterparts). On my ARM PC, gxhash Hasher is now faster than ahash for such inputs. My on X86 PC, gxhash remain a bit slower for these inputs (about 10% slower). I have a doubt in ahash Hasher passing SMHasher quality test for such inputs.

ogxd self-assigned this Dec 6, 2023

ogxd mentioned this issue Dec 6, 2023

Hybrid state gxhash #34

Closed

3 tasks

ogxd added the performance 🚀 label Dec 7, 2023

ogxd linked a pull request Dec 10, 2023 that will close this issue

Add Hasher write_x primitives #42

Merged

ogxd closed this as completed in #42 Dec 10, 2023

ogxd mentioned this issue Dec 12, 2023

cargo bench --no-fail-fast -F=avx2 failed #43

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Primitives specialization for Hasher #40

Primitives specialization for Hasher #40

ogxd commented Dec 6, 2023 •

edited

Loading

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023 •

edited

Loading

Primitives specialization for Hasher #40

Primitives specialization for Hasher #40

Comments

ogxd commented Dec 6, 2023 • edited Loading

Context

Goals

Todo

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023

ogxd commented Dec 10, 2023 • edited Loading

ogxd commented Dec 6, 2023 •

edited

Loading

ogxd commented Dec 10, 2023 •

edited

Loading