diff --git a/previews/PR44/post/2024-05-27-cuda_5.4/index.html b/previews/PR44/post/2024-05-27-cuda_5.4/index.html index 7afcae4..e77643e 100644 --- a/previews/PR44/post/2024-05-27-cuda_5.4/index.html +++ b/previews/PR44/post/2024-05-27-cuda_5.4/index.html @@ -295,7 +295,7 @@
To keep this post from becoming even longer, a quick rundown of other changes:
These changes should make it easier to port applications to the GPU by incrementally moving parts of the codebase to the GPU without having to worry about the performance of accessing memory from the CPU. The only requirement is to use unified memory, e.g., by calling cu
with unified=true
, or setting a global CUDA.jl preference to use unified memory by default. However, as unified memory comes with a slight cost, and results in synchronous allocation behavior, it is still recommended to switch back to regular device memory when your application has been fully ported to the GPU.
These changes should make it easier to port applications to the GPU by incrementally moving parts of the codebase to the GPU without having to worry about the performance of accessing memory from the CPU. The only requirement is to use unified memory, e.g., by calling cu
with unified=true
, or setting the CUDA.jl preference default_memory
to use unified memory by default. However, as unified memory comes with a slight cost, and results in synchronous allocation behavior, it is still recommended to switch back to regular device memory when your application has been fully ported to the GPU.
To keep this post from becoming even longer, a quick rundown of other changes: