the default hash function for floats/doubles is not very good. #538

nicebyte · 2024-08-27T05:24:10Z

the default hash for floats works by casting the value to size_t:

	template <> struct hash<float>
		{ size_t operator()(float val) const { return static_cast<size_t>(val); } };

this is quite bad because anything that is between two integers gets hashed to the same value. my hash table essentially turned into a linear array because most of the values happened to be in the [0; 1] range.

the same applies to hashes for other floating point types (double, long double etc.).

when fixing this, keep in mind that floating point numbers have two representations for 0 (0 and -0) - those two bit patterns should hash to the same value because they represent the same number.

The text was updated successfully, but these errors were encountered:

SirNate0 · 2024-09-23T18:01:24Z

Counterpoint. -0 and 0 are not the same value: They have different behavior when you, for example, do 1.0/-0.0 vs 1.0/+0.0 (giving negative vs positive infinity). But since -0.0==+0.0 (in IEEE 754 at least), it does seem reasonable to make them hash to the same value. Though since NAN == NAN is false, but hash(NAN) == hash(NAN) would be true, there will always be at least that case* where (hash(x) == hash(y)) != (x == y), so it doesn't seem too ridiculous to have hash(-0.0) != hash(+0.0). In other words, replacing the static_cast with a bit_cast doesn't seem too unreasonable.

* Actually many cases when you consider there are many possible bit values for NAN.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the default hash function for floats/doubles is not very good. #538

the default hash function for floats/doubles is not very good. #538

nicebyte commented Aug 27, 2024

SirNate0 commented Sep 23, 2024

the default hash function for floats/doubles is not very good. #538

the default hash function for floats/doubles is not very good. #538

Comments

nicebyte commented Aug 27, 2024

SirNate0 commented Sep 23, 2024