Kahan 2x2 determinant computation reduces numerical imprecision. #239

TotoGaz · 2021-07-29T00:28:42Z

Related to issue GEOS-DEV/GEOS#1440: some tests testTensorOpsInverseTwoArgs and testTensorOpsInverseOneArg are failing due to large numerical imprecision.

Using the Kahan method to compute 2x2 determinants brings more precision.
3x3 case still untouched.

Warning, I sometimes get compilation warnings like

../coreComponents/LvArray/unitTests/../src/fixedSizeSquareMatrixOpsImpl.hpp(100): warning: calling a constexpr __host__ function("fma") from a __host__ __device__ function("determinant") is not allowed. The experimental flag '--expt-relaxed-constexpr' can be used to allow this.

which I find surprising since this fma function is supposed to be device friendly.
fma also seems pretty standard so one should not need any fallback.
So I need to double-check.

Any opinion?

corbett5

If we want to use fma we should put it in LvArray::math and wrap it such that it uses std::fma on host and CUDA's fma on device.

Also this modification makes calculating the determinant significantly more costly. I'm not sure you can answer this question but maybe the kind of matrices that pop up in the unit tests aren't representative of the normal use case.

corbett5 · 2021-07-29T16:43:10Z

src/fixedSizeSquareMatrixOpsImpl.hpp

-    return matrix[ 0 ][ 0 ] * matrix[ 1 ][ 1 ] - matrix[ 0 ][ 1 ] * matrix[ 1 ][ 0 ];
+
+    // Aliases for matrix [[a, b],[c, d]] for improved readability
+    auto const & a = matrix[0][0];


I think matrix[ 0 ][ 0 ] is more readable.

OK, I will change this.

Also this modification makes calculating the determinant significantly more costly. I'm not sure you can answer this question but maybe the kind of matrices that pop up in the unit tests aren't representative of the normal use case.

It is 3 flop vs 2 flop. Look at the second variant I listed in https://godbolt.org/z/4bGdjYbd5

I don't think this is a big deal...for 3d it may be a bit more costly...but not much. I don't think we will often run into this sort of round off problem, but then again, if it doesn't cost much to "fix", then I kind of like this approach.

corbett5 · 2021-07-29T16:45:02Z

src/fixedSizeSquareMatrixOpsImpl.hpp

+
+    // Using the more precise Kahan method to compute the 2x2 determinant.
+    auto const w = b * c;
+    auto const e = fma( -b, c, w );


Is the fma required to obtain adequate precision? I'd prefer to do auto const e = -b * c + w and let the compiler do it's thing.

Sure, I'll check too.

https://godbolt.org/z/ao9GMhPsq

I think you need the fma

TotoGaz · 2021-07-29T17:39:46Z

Also this modification makes calculating the determinant significantly more costly. I'm not sure you can answer this question but maybe the kind of matrices that pop up in the unit tests aren't representative of the normal use case.

I could spot 3x3 dets in the code (FEM stuff), but only 2x2 in the unit tests.
I will double check.

EDIT: no 2x2 dets in the code, only in tests.

I've built an integer dummy fallback in order to prevent integers to be cast to double (I had issues).

Kahan 2x2 determinant computation reduces numerical imprecision.

b2a656b

TotoGaz requested a review from corbett5 July 29, 2021 00:28

TotoGaz mentioned this pull request Jul 29, 2021

Unit tests whith GPU not passing inside nvidia-docker env GEOS-DEV/GEOS#1440

Closed

corbett5 approved these changes Jul 29, 2021

View reviewed changes

Defining the fma function in LvArray::math.

96b1434

I've built an integer dummy fallback in order to prevent integers to be cast to double (I had issues).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kahan 2x2 determinant computation reduces numerical imprecision. #239

Kahan 2x2 determinant computation reduces numerical imprecision. #239

TotoGaz commented Jul 29, 2021

corbett5 left a comment

corbett5 Jul 29, 2021

TotoGaz Jul 29, 2021

rrsettgast Jul 29, 2021

corbett5 Jul 29, 2021

TotoGaz Jul 29, 2021

rrsettgast Jul 29, 2021 •

edited

Loading

TotoGaz commented Jul 29, 2021 •

edited

Loading

Kahan 2x2 determinant computation reduces numerical imprecision. #239

Are you sure you want to change the base?

Kahan 2x2 determinant computation reduces numerical imprecision. #239

Conversation

TotoGaz commented Jul 29, 2021

corbett5 left a comment

Choose a reason for hiding this comment

corbett5 Jul 29, 2021

Choose a reason for hiding this comment

TotoGaz Jul 29, 2021

Choose a reason for hiding this comment

rrsettgast Jul 29, 2021

Choose a reason for hiding this comment

corbett5 Jul 29, 2021

Choose a reason for hiding this comment

TotoGaz Jul 29, 2021

Choose a reason for hiding this comment

rrsettgast Jul 29, 2021 • edited Loading

Choose a reason for hiding this comment

TotoGaz commented Jul 29, 2021 • edited Loading

rrsettgast Jul 29, 2021 •

edited

Loading

TotoGaz commented Jul 29, 2021 •

edited

Loading