Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix local work size for conv kernel yxfb_yxio_b16 with fp16 (openvino…
…toolkit#11679) convolution_gpu_yxfb_yxio_b16 for fp16 has hardcoded reqd_work_group_size to (16, 1, 1). On devices where CL_DEVICE_MAX_WORK_GROUP_SIZE is 512 GetOptimalLocalWorkGroupSizes picks (16, 2, 1) for LWS. That causes issues during clEnqueueNDRangeKernel since LWS doesn't match with reqd_work_group_size in the kernel.
- Loading branch information