You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the incomplete Cholesky factorization routine, a significant amount of time is being used in getindex() and setindex!() routine. This seems related to computing the entries of L. We need to investigate a more GPU friendly way of doing this.
Candidate algorithms:
We could just use NVIDIA's incomplete Cholesky factorization routine (with zero fill-in), but its performance could be very different from ours, because their implementation is quite different from ours.
We could investigate a GPU friendly way of performing computation, especially focusing on small matrices.
The text was updated successfully, but these errors were encountered:
In the incomplete Cholesky factorization routine, a significant amount of time is being used in getindex() and setindex!() routine. This seems related to computing the entries of L. We need to investigate a more GPU friendly way of doing this.
Candidate algorithms:
The text was updated successfully, but these errors were encountered: