more accurate sqrt function #129

pascalkuthe · 2024-05-11T11:40:16Z

I noticed that the square root implementation in num-complex uses conversion trough polar coordinates to compute complex squre roots. Usually the algorithm from https://dl.acm.org/doi/abs/10.1145/363717.363780 is used to compute the complex square root instead.

The algorithm uses hypot/norm and square root to compute csqrt. This approach should be faster since less transcendental calls are needed. Hypot and sqrt also tend to be faster compared to exp/ln/atan/cos/sin. Both hypot and sqart also have much higher precision. Most implementations guarantee that these two functions return the correctly rounded infinite accuracy result.

For prior art you can look at the glibc and musl implementation (https://git.musl-libc.org/cgit/musl/tree/src/complex/csqrt.c). The glibc implementation is a lot more complicated/hard to read because they ensure that underflow floating point exceptions are triggered correctly. I don't think that is something num-complex needs to do. There is also some accuracy loss for subnormal numbers that I didn't handle yet. I left a comment about it, but it is very minor.

cuviper

Thanks! I didn't know about this algorithm.

cuviper · 2024-05-13T23:17:12Z

src/lib.rs

+        //     Complex64::new(2.4421097261308304e-162, 1.0115549693666347e-162)
+        // );
+
+        if self.re.is_zero() && self.im.is_zero() {


Can you add a source for all these special cases? e.g.
https://en.cppreference.com/w/c/numeric/complex/csqrt
(and make sure all those are covered)

I added more test to test_nan to make sure all of these are covered by theses and added a comment

cuviper · 2024-05-13T23:20:08Z

src/lib.rs

+        }
+        if self.re.is_nan() {
+            // nan + nan i
+            return Self::new(self.re, (self.im - self.im) / (self.im - self.im));


We have a direct NaN -- not sure if this should also copysign though.

Suggested change

return Self::new(self.re, (self.im - self.im) / (self.im - self.im));

return Self::new(self.re, T::nan());

I wasn't 100% sure about the sign of nan here and 100% matched the code just to be sure but it seems T::nan() is indeed sufficient and the sign doesn't change.

cuviper · 2024-05-13T23:22:55Z

src/lib.rs

+            // √(inf +/- x i)    = inf +/-  0 i
+            // √(-inf +/- NaN i) = NaN +/- inf i
+            // √(-inf +/- x i)   = 0 +/- inf i
+


Maybe add a variable to make this clearer:

Suggested change

#[allow(clippy::eq_op)]

let zero_or_nan = self.im - self.im;

that is indeed more readable, I also added a comments. good point

cuviper · 2024-05-13T23:25:19Z

src/lib.rs

+        if scale {
+            self = self / four;
+        }
+        if self.re.is_sign_negative() {


We could also use a citation and link in a comment for the algorithm you mentioned.

I added a citation to the algorithm and the musl libc implementation as well as provide some additional background in a comement

pascalkuthe · 2024-05-21T06:51:58Z

Thanks for the fast review! Apologies for not getting back to this yet.

I had to unexpectetly travel for work so I will not be able to get back to this before the weekend.

pascalkuthe · 2024-05-28T12:24:16Z

This should be ready for another round of review, apologies again for the delay

ralphtandetzky

LGTM.

pascalkuthe · 2024-12-08T20:49:15Z

@cuviper still quite interested in this.

Anything I can do to push this over the finish line?

cuviper · 2024-12-11T00:47:43Z

How about tests that demonstrate the improved accuracy?

pascalkuthe · 2024-12-14T23:31:51Z

@cuviper testing for accuracy is tricky. I used numpy (which generates the same results as glibc/musl) to generate some reference numbers for some cases that round differently with the current implementation (but the same with the new one).

This is a bit of a chicken and egg problem with showing that the rounding is truely more accurate but since this is a well known algorithm (and very well known implementations) I think it's a fair approach.

I choose some arbitrary examples to demonstrate accuracy. The current implementation rounds incorrectly for a ton of numbers so it's not practical to test exhaustively

cuviper · 2024-12-14T23:45:10Z

Does the accuracy show itself by comparing the result squared to the original value? You may be able to use simple inputs without having to hard-code expected results. Hopefully that round trip is better now than it was before, at least for some inputs. (and hopefully not any worse in general, but I agree we can't really be exhaustive about it)

pascalkuthe · 2024-12-15T01:39:54Z

I did find a few where the roundtrip was better by an ULP or two. Some of the accuracy gets lost (particularly since it's multiple operations for complex) during the multiplication so its not perfect test (makes it a bit harder to find cases) but a good addition, thanks!

During some trial and error I also never had any case where accuracy was worse.

It was quite a while since I worked on this but I think I also semi exhaustively (somewhat sampled) tested against glibc at some point IIRC. With this algorithm I am pretty certain that existing implementations get this right so perfectly matching those is the test that gave me the most confidence in the implementation

cuviper reviewed May 13, 2024

View reviewed changes

more accurate sqrt function

ed22935

pascalkuthe force-pushed the sqrt branch from 88f491f to ed22935 Compare May 28, 2024 12:21

ralphtandetzky approved these changes Jul 21, 2024

View reviewed changes

add accuracy testcase

6178de6

add roundtrip tests

c5b8101

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

more accurate sqrt function #129

more accurate sqrt function #129

pascalkuthe commented May 11, 2024

cuviper left a comment

cuviper May 13, 2024

pascalkuthe May 28, 2024

cuviper May 13, 2024

pascalkuthe May 28, 2024

cuviper May 13, 2024

pascalkuthe May 28, 2024

cuviper May 13, 2024

pascalkuthe May 28, 2024

pascalkuthe commented May 21, 2024

pascalkuthe commented May 28, 2024

ralphtandetzky left a comment

pascalkuthe commented Dec 8, 2024 •

edited

Loading

cuviper commented Dec 11, 2024

pascalkuthe commented Dec 14, 2024

cuviper commented Dec 14, 2024

pascalkuthe commented Dec 15, 2024 •

edited

Loading

	return Self::new(self.re, (self.im - self.im) / (self.im - self.im));
	return Self::new(self.re, T::nan());

more accurate sqrt function #129

Are you sure you want to change the base?

more accurate sqrt function #129

Conversation

pascalkuthe commented May 11, 2024

cuviper left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pascalkuthe commented May 21, 2024

pascalkuthe commented May 28, 2024

ralphtandetzky left a comment

Choose a reason for hiding this comment

pascalkuthe commented Dec 8, 2024 • edited Loading

cuviper commented Dec 11, 2024

pascalkuthe commented Dec 14, 2024

cuviper commented Dec 14, 2024

pascalkuthe commented Dec 15, 2024 • edited Loading

pascalkuthe commented Dec 8, 2024 •

edited

Loading

pascalkuthe commented Dec 15, 2024 •

edited

Loading