initialize zp, scale loaded from HF quantizer, applying quant_config #84

horheynm · 2024-06-14T14:13:01Z

When base models are loaded using HF quantizer, weights are frozen.
ZP and scale attributes do not exist. If not quantized, set it to Initialize.

Tests in HF quantizer:
neuralmagic/transformers#102

markurtz · 2024-10-18T01:13:12Z

@dsikka can you confirm that this will be handled by your observer refactor and we'll be able to close this out?

dsikka · 2024-10-22T16:32:18Z

Is this for a dense model? Why are zero-points and scales missing?
Please expand in the PR description + provide an example model and code snipped with this issue @horheynm

initialize zp, scale loaded from HF quantizer, applying quant_config

4b514a5

horheynm marked this pull request as draft June 14, 2024 17:39

Merge branch 'main' into init-model-load

6fd15bf

dsikka assigned horheynm Oct 22, 2024

horheynm closed this Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initialize zp, scale loaded from HF quantizer, applying quant_config #84

initialize zp, scale loaded from HF quantizer, applying quant_config #84

horheynm commented Jun 14, 2024 •

edited

Loading

markurtz commented Oct 18, 2024

dsikka commented Oct 22, 2024 •

edited

Loading

initialize zp, scale loaded from HF quantizer, applying quant_config #84

initialize zp, scale loaded from HF quantizer, applying quant_config #84

Conversation

horheynm commented Jun 14, 2024 • edited Loading

markurtz commented Oct 18, 2024

dsikka commented Oct 22, 2024 • edited Loading

horheynm commented Jun 14, 2024 •

edited

Loading

dsikka commented Oct 22, 2024 •

edited

Loading