-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Removing cl_khr_fp16 #264
Comments
It was inserted on ac3adc9 to solve #131 . @vorj @nsakabe-fixstars Can you remember and explain the reason? |
No, problematic types are
I really think so, but NVIDIA doesn't . |
@vorj Thanks. Your previous comment
must be checked on the current titanv (CUDA 10). Furthermore, to solve the problem with only the copy operation, for example, we can treat fp16 as int16. I remember there were some reasons to disturb such solutions but I don't remember what they actually were. Anyway, I don't think that investigation for the solution to avoid it is perfect. The priority to pass the tests was higher than enough investigation at that time. |
Please see the secondary machine error log for #260 .
It seems that NVIDIA doesn't allowed to copy |
#269 fixed this issue, so I close it. |
As of #224 and #261, the current solution for "half" needs cl_khr_fp16.
However, it must not need it theoretically and could be removed.
The text was updated successfully, but these errors were encountered: