Support for int8 matrix multiplication #287

hchoi-moveworks · 2023-02-16T00:05:51Z

hchoi-moveworks
Feb 16, 2023

Would there be a support for optimizing model that leverages int8 matmul1 ?

pommedeterresautee · 2023-02-24T05:48:05Z

pommedeterresautee
Feb 24, 2023
Maintainer

This work is ongoing. Perf is good but not yet on par with cutlass (like 10-20% slower). We are working on a full process to support quantization out of the box.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for int8 matrix multiplication #287

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Support for int8 matrix multiplication #287

hchoi-moveworks Feb 16, 2023

Replies: 1 comment

pommedeterresautee Feb 24, 2023 Maintainer

hchoi-moveworks
Feb 16, 2023

pommedeterresautee
Feb 24, 2023
Maintainer