This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
[BesTLA] Improve RTN quantization accuracy of int4 and int3 #568
Job | Run time |
---|---|
1m 43s | |
1m 18s | |
1m 27s | |
1m 28s | |
11m 46s | |
17m 42s |
Job | Run time |
---|---|
1m 43s | |
1m 18s | |
1m 27s | |
1m 28s | |
11m 46s | |
17m 42s |