Optimized transformations by transposing and then multiplying in place #424

YoshuaNava · 2020-12-02T12:17:57Z

After merging #419 I put some extra time in finding out if we could further optimize the transformation of descriptors.

Based on that I implemented some snippets:

To benchmark the features transformation: https://godbolt.org/z/ehWfKG
To benchmark the descriptors rotation: https://godbolt.org/z/7xrnjW

I found out that for matrices like the ones we are processing (MxN, with M sort of small [1,10], but N quite large), it's better to transpose first, and then apply on the left. The Eigen parser seems to faster code when we do this.

Compared to what we had before:

The proportion of time spent between the operations is still similar 30-70% current, but the absolute time taken by the functions to process the data is shorter.

My thought on why this happens is that when we transpose we might be loading the matrix in L2/L3 cache (even though we don't enforce Eigen) in-time evaluation, and when the compiler sees applyOnTheRight, it optimizes for an in-place operation on a matrix that is dominantly column-based.

Something curious I found is that with compiler explorer you can try out different compilers, and the code runs just a bit faster with icc.

pomerlef · 2020-12-02T14:35:30Z

YoshuaNava · 2020-12-02T14:50:04Z

@pomerlef I'm trying to reproduce the issue locally and on internal CI but I haven't been able to. On which platform are the jenkins tests failing?

pomerlef · 2020-12-02T14:52:59Z

all

pomerlef · 2020-12-02T15:06:18Z

It's seems to be related to 2D transformations. There are all binary test:

Error Message
/home/jenkins/workspace/pointmatcher/label/ubuntu-xenial/utest/ui/Transformations.cpp:39
Value of: transformedFeature.isApprox(transformedCloud.features.col(i), kEpsilonNumericalError)
  Actual: false
Expected: true
Stacktrace
/home/jenkins/workspace/pointmatcher/label/ubuntu-xenial/utest/ui/Transformations.cpp:39
Value of: transformedFeature.isApprox(transformedCloud.features.col(i), kEpsilonNumericalError)
  Actual: false
Expected: true

YoshuaNava · 2020-12-03T11:15:25Z

I quickly looked into the issue, and the problem appeared because the results are slightly different with the new method for transforming features. I'm not sure whether this is because:

The new method is similarly accurate to the one from the test introduced in Implemented an in-place method for transforming DataPoints objects #419, but a different implementation leads to slightly different results, with accuracy bounded by floating point representations, or
The new method is less accuracte.

This issue only pops up on my PC, with a fork of libpointmatcher, when I set the epsilon for error checking to 1e-13. On libpointmatcher from this repo (upstream), it happens with 1e-8.

I'm looking into this.

pomerlef · 2021-07-06T18:05:59Z

@YoshuaNava could you have a look to resolve conflicts?

pomerlef · 2023-03-17T20:08:56Z

@YoshuaNava could you verify the conflict?

Optimized transformations by transposing and then multiplying in place

289a906

Better comments

1eb4850

Adjust transformations unit test epsilon

520f67f

pomerlef added 2 commits February 12, 2021 08:45

Merge branch 'master' into feature/opt_transformations_transpose

25dc18c

Merge branch 'master' into feature/opt_transformations_transpose

4724a41

boxanm changed the base branch from master to develop November 4, 2023 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized transformations by transposing and then multiplying in place #424

Optimized transformations by transposing and then multiplying in place #424

YoshuaNava commented Dec 2, 2020 •

edited

Loading

pomerlef commented Dec 2, 2020

YoshuaNava commented Dec 2, 2020 •

edited

Loading

pomerlef commented Dec 2, 2020

pomerlef commented Dec 2, 2020

YoshuaNava commented Dec 3, 2020 •

edited

Loading

pomerlef commented Jul 6, 2021

pomerlef commented Mar 17, 2023

Optimized transformations by transposing and then multiplying in place #424

Are you sure you want to change the base?

Optimized transformations by transposing and then multiplying in place #424

Conversation

YoshuaNava commented Dec 2, 2020 • edited Loading

pomerlef commented Dec 2, 2020

YoshuaNava commented Dec 2, 2020 • edited Loading

pomerlef commented Dec 2, 2020

pomerlef commented Dec 2, 2020

YoshuaNava commented Dec 3, 2020 • edited Loading

pomerlef commented Jul 6, 2021

pomerlef commented Mar 17, 2023

YoshuaNava commented Dec 2, 2020 •

edited

Loading

YoshuaNava commented Dec 2, 2020 •

edited

Loading

YoshuaNava commented Dec 3, 2020 •

edited

Loading