Optimize square root instruction #221

edubart · 2024-04-02T11:25:24Z

Context

Currently fsqrt.d is one of the most slow instructions that can be called from userspace, for instance it is roughly 3x slower than other floating point instructions, we could improve it. For dapps running untrusted RISC-V code, this function could be abused to make the dapp validation intentionally slower.

Measurements:

fsqrt.d                                        60.309 MIPS    42852 ucycles
fmul.d                                        152.402 MIPS     6964 ucycles
mul                                           494.966 MIPS     6434 ucycles

I added other instructions speed as reference. Also sqrt seems to be taking a large number of microarchitecture cycles.

EDIT: Seems like fdiv.d causes a iterations of 128 loops in uarch due to our 128bit implementation, we could also optimize that.

Possible solutions

Our current implementation is using Newton's method to find the square root, with many iterations. Seems like "Berkeley Softfloat" gets away without for loops, using fast invert square root, possible inspired by the famous Quake's fast invert square root. We could investigate how this is done, removing for loops would be the ideal case for running in microarchitecture.

The text was updated successfully, but these errors were encountered:

edubart added enhancement New feature or request optimization Optimization labels Apr 2, 2024

edubart self-assigned this Apr 2, 2024

edubart added this to Machine Emulator SDK Apr 2, 2024

github-project-automation bot moved this to Todo in Machine Emulator SDK Apr 2, 2024

edubart removed their assignment Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize square root instruction #221

Optimize square root instruction #221

edubart commented Apr 2, 2024 •

edited

Loading

Optimize square root instruction #221

Optimize square root instruction #221

Comments

edubart commented Apr 2, 2024 • edited Loading

Context

Possible solutions

edubart commented Apr 2, 2024 •

edited

Loading