You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merge first PR Cufft primitive #204 that uses CUFFT as backend for FFTs and does the spectral extensions and folding in cuda kernels
Merge all kernels in one to have parallel ffts in one kernel
Handle reality case with C2R transforms
For the multi GPU one
Discuss load balancing
There is not way to distribute a healpix array in a clean way .. distributing must not have conditions on local slice index otherwise it will not scale
Distributing probably requires having a seperate padding function, that transforms a healpix array into a padded one with the shape of the corresponding FTM
Padded healpix array remains padded at all times to make sure that it is always distributable
FLM are squares so they can be easily distributed but only in 1 dimension since the other is always (number of rings) is always odd
Implement padding operator
The text was updated successfully, but these errors were encountered:
Quick roadmap on the multi GPU healpix FFT
For the single GPU one
For the multi GPU one
The text was updated successfully, but these errors were encountered: