Allow Shortcutting Min-max Observer #887

kylesayrs · 2024-11-01T18:10:05Z

Purpose

Speedup runtime of algorithms which require many observations with no averaging such as [GPTQ] Iterative Parameter Updating #863

Changes

Renamed MovingAverageMinMaxObserver -> MinMaxObserver since moving average is not required to use it
Shortcut averaging logic by checking self.averaging_constant == 1.0
Update docstrings, ect.

Testing

Ran examples/quantization_w4a16/llama3_example.py to completion

Signed-off-by: Kyle Sayers <[email protected]>

github-actions · 2024-11-01T18:10:15Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Signed-off-by: Kyle Sayers <[email protected]>

dsikka · 2024-11-22T19:06:13Z

src/llmcompressor/modifiers/quantization/calibration.py

        observer = Observer.load_from_registry(
-            observer, quantization_args=quantization_args
+            quantization_args.observer, quantization_args=quantization_args


I think we should be consistent in how we're fetching the observer - either use the get_observer method or remove it and do it how you're doing it here.

Personally in favor of removing the get_observer now that observer refactor work is done

#939

kylesayrs added 2 commits November 1, 2024 18:02

change defaults and name

70f4069

Signed-off-by: Kyle Sayers <[email protected]>

update docstring, typehints

56214ae

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs added 2 commits November 1, 2024 18:11

change defaulting averaging_constant

6049f4f

Signed-off-by: Kyle Sayers <[email protected]>

update docstring

687b3e9

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs marked this pull request as ready for review November 4, 2024 21:46

kylesayrs self-assigned this Nov 4, 2024

kylesayrs added 2 commits November 19, 2024 13:47

Merge branch 'main' into kylesayrs/min-max-defaults

5c9c553

Merge branch 'main' into kylesayrs/min-max-defaults

1df0705

dsikka approved these changes Nov 21, 2024

View reviewed changes

Merge branch 'main' into kylesayrs/min-max-defaults

d1ecec9

dsikka reviewed Nov 22, 2024

View reviewed changes

Merge branch 'main' into kylesayrs/min-max-defaults

e2c3f89

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Shortcutting Min-max Observer #887

Allow Shortcutting Min-max Observer #887

kylesayrs commented Nov 1, 2024 •

edited

Loading

github-actions bot commented Nov 1, 2024

dsikka Nov 22, 2024

kylesayrs Nov 27, 2024

Allow Shortcutting Min-max Observer #887

Are you sure you want to change the base?

Allow Shortcutting Min-max Observer #887

Conversation

kylesayrs commented Nov 1, 2024 • edited Loading

Purpose

Changes

Testing

github-actions bot commented Nov 1, 2024

dsikka Nov 22, 2024

Choose a reason for hiding this comment

kylesayrs Nov 27, 2024

Choose a reason for hiding this comment

kylesayrs commented Nov 1, 2024 •

edited

Loading