This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
[BesTLA] Improve RTN quantization accuracy of int4 and int3 #563
Job | Run time |
---|---|
2m 2s | |
1m 27s | |
1m 28s | |
1m 13s | |
11m 41s | |
17m 51s |
Job | Run time |
---|---|
2m 2s | |
1m 27s | |
1m 28s | |
1m 13s | |
11m 41s | |
17m 51s |