[ONNX] Fix sporadic results in BC #3081

kshpv · 2024-11-13T16:50:37Z

Changes

This PR addresses an issue using ONNXRuntime==1.19.2 where a tensor used as both an input and output in a model shares the same memory. This causes unexpected behavior: updating the input tensor inadvertently modifies the statistics data due to memory overlap.
The issue was confirmed by calling np.shares_memory(input_data['image'], outputs['image']), which returned True, indicating that the input and output tensors share memory. After applying the proposed changes, the same check now returns False, confirming that memory sharing is resolved.
To fix this, the ONNXEngine logic has been updated to create a copy of any output tensor that is also used as a model input. This ensures that the input tensor and statistics data remain independent, avoiding unintended side effects.
Merge RawReducer and NoopReducer
Minor fixes (remove warnings + fix bug in BC)

Reason for changes

Regression

Related tickets

156025

Tests

PTQ run 549

kshpv · 2024-11-13T16:51:39Z

547 job - ONNX
548 job - all

kshpv · 2024-11-14T11:39:09Z

549 ptq - passed
post_training_quantization_performance 80 - passed
post_training_weight_compression 251 - passed
post_training_weight_compression_performance 19 - passed

nikita-savelyevv · 2024-11-14T14:23:23Z

nncf/quantization/fake_quantize.py

-    ra = fns.where(qval < level_high, qval / (qval - level_high) * right_border, left_border)
    with warnings.catch_warnings():
        # If `qval` is 0 `rb` will equal `right_border`, and we don't want to show an unnecessary division by 0 warning
+        # The same for (qval - level_high)
        warnings.simplefilter("ignore")
+        ra_then_result = qval / (qval - level_high) * right_border
        rb_then_result = (qval - level_high) / qval * left_border
+    ra = fns.where(qval < level_high, ra_then_result, left_border)
    rb = fns.where(qval > 0.0, rb_then_result, right_border)


Could you please expand

nncf/tests/common/quantization/test_tune_range.py

Line 19 in f0873ca

def test_tune_range_zero_division_warning():

to cover the new case?

done. test passes on PR and does not on develop

nncf/experimental/common/tensor_statistics/collectors.py

nncf/quantization/fake_quantize.py

KodiaqQ · 2024-11-15T12:03:08Z

@kshpv, feel free to merge this PR as you need.

nncf/quantization/fake_quantize.py

alexsu52 · 2024-11-18T17:04:28Z

nncf/quantization/fake_quantize.py

-        warnings.simplefilter("ignore")
-        rb_then_result = (qval - level_high) / qval * left_border
+    # Avoid division by zero
+    qval_nonzero = fns.where(qval == 0, fns.ones_like(qval), qval)


Please, check the performance of the function after your changes. @nikita-savelyevv, it looks like we discussed it already. Could you remember us the solution?

As I remember ignoring the warning was the solution with the least impact on performance.

So, should I rollback or keep this one?

I'm inclined towards moving the line under the catch_warning context manager.

Perf measurment:
with warnings context manager: ~17.648 sec
without warning context manager but with: ~20.809 sec

qval_nonzero = fns.where(qval == 0, fns.ones_like(qval), qval) qval_not_high = fns.where(qval - level_high == 0, fns.ones_like(qval), qval - level_high)

@kshpv you can use just 1.0 instead of fns.ones_like(qval)

qval_nonzero = fns.where(qval == 0, 1.0 , qval) qval_not_high = fns.where(qval - level_high == 0, 1.0 , qval - level_high)

@kshpv you can use just 1.0 instead of fns.ones_like(qval)

qval_nonzero = fns.where(qval == 0, 1.0 , qval) qval_not_high = fns.where(qval - level_high == 0, 1.0 , qval - level_high)

I checked your proposed version and it has the same performance as with fns.ones_like(qval) :(

It's more about avoiding creating extra instances of tensor than performance

I suggest to use implementation with the best performance.

kshpv · 2024-11-19T16:24:48Z

Found the open issue with the same problem on ONNXRuntime - microsoft/onnxruntime#21922

alexsu52

LGTM

github-actions bot added NNCF ONNX Pull requests that updates NNCF ONNX NNCF PTQ Pull requests that updates NNCF PTQ labels Nov 13, 2024

kshpv added 3 commits November 13, 2024 17:56

fix model_transformer and remove warning

616f014

fix formula

49a822c

avoid division 0 in ra as well

eef02ef

github-actions bot added the experimental label Nov 14, 2024

kshpv added 2 commits November 14, 2024 01:18

some fixes

6e8ff73

deepcopy to noop reducer

346e5ea

kshpv changed the title ~~[ONNX] Fix ModelTransformer~~ [ONNX] Fix sporadic results in BC Nov 14, 2024

rollback formula in FQ; upd clone in reducer

73aaf19

github-actions bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF Common Pull request that updates NNCF Common NNCF OpenVINO Pull requests that updates NNCF OpenVINO labels Nov 14, 2024

kshpv added 3 commits November 14, 2024 13:04

merge NoopReducer and RawReducer

f71f6ed

Merge remote-tracking branch 'remote/develop' into onnx_BC_fix

9d15d25

upd to RawReducer

fb2e858

kshpv marked this pull request as ready for review November 14, 2024 14:05

kshpv requested a review from a team as a code owner November 14, 2024 14:05

kshpv requested review from daniil-lyakhov, nikita-savelyevv and KodiaqQ November 14, 2024 14:06

nikita-savelyevv reviewed Nov 14, 2024

View reviewed changes

daniil-lyakhov reviewed Nov 14, 2024

View reviewed changes

nncf/experimental/common/tensor_statistics/collectors.py Outdated Show resolved Hide resolved

extend test

0ddf450

nikita-savelyevv approved these changes Nov 14, 2024

View reviewed changes

add check on id for RawReducer

ffbff3f

daniil-lyakhov approved these changes Nov 14, 2024

View reviewed changes

KodiaqQ reviewed Nov 14, 2024

View reviewed changes

nncf/experimental/common/tensor_statistics/collectors.py Outdated Show resolved Hide resolved

nncf/quantization/fake_quantize.py Show resolved Hide resolved

KodiaqQ requested review from KodiaqQ and removed request for KodiaqQ November 15, 2024 12:03

kshpv marked this pull request as draft November 15, 2024 12:04

rollback collectors change; add copying to ONNXEngine

b830d8d

alexsu52 reviewed Nov 17, 2024

View reviewed changes

nncf/quantization/fake_quantize.py Show resolved Hide resolved

kshpv added 3 commits November 18, 2024 11:59

add comment

35aff71

rollback test

10fa44c

avoid division by zero

672a661

kshpv requested a review from alexsu52 November 18, 2024 14:02

kshpv marked this pull request as ready for review November 18, 2024 14:02

alexsu52 reviewed Nov 18, 2024

View reviewed changes

kshpv added 2 commits November 19, 2024 17:29

add comment

788dbbd

more comment

c18433b

kshpv requested a review from alexsu52 November 20, 2024 10:51

rollback to warnings

96a3867

alexsu52 approved these changes Nov 21, 2024

View reviewed changes

alexsu52 assigned AlexanderDokuchaev Nov 21, 2024

AlexanderDokuchaev approved these changes Nov 25, 2024

View reviewed changes

AlexanderDokuchaev merged commit 2284df5 into openvinotoolkit:develop Nov 25, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Fix sporadic results in BC #3081

[ONNX] Fix sporadic results in BC #3081

kshpv commented Nov 13, 2024 •

edited

Loading

kshpv commented Nov 13, 2024 •

edited

Loading

kshpv commented Nov 14, 2024 •

edited

Loading

nikita-savelyevv Nov 14, 2024

kshpv Nov 14, 2024 •

edited

Loading

KodiaqQ commented Nov 15, 2024

alexsu52 Nov 18, 2024

nikita-savelyevv Nov 18, 2024 •

edited

Loading

kshpv Nov 19, 2024

nikita-savelyevv Nov 19, 2024

kshpv Nov 19, 2024

AlexanderDokuchaev Nov 19, 2024

kshpv Nov 19, 2024

AlexanderDokuchaev Nov 19, 2024

alexsu52 Nov 21, 2024

kshpv Nov 21, 2024

kshpv commented Nov 19, 2024

alexsu52 left a comment

[ONNX] Fix sporadic results in BC #3081

[ONNX] Fix sporadic results in BC #3081

Conversation

kshpv commented Nov 13, 2024 • edited Loading

Changes

Reason for changes

Related tickets

Tests

kshpv commented Nov 13, 2024 • edited Loading

kshpv commented Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

kshpv Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

KodiaqQ commented Nov 15, 2024

Choose a reason for hiding this comment

nikita-savelyevv Nov 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kshpv commented Nov 19, 2024

alexsu52 left a comment

Choose a reason for hiding this comment

kshpv commented Nov 13, 2024 •

edited

Loading

kshpv commented Nov 13, 2024 •

edited

Loading

kshpv commented Nov 14, 2024 •

edited

Loading

kshpv Nov 14, 2024 •

edited

Loading

nikita-savelyevv Nov 18, 2024 •

edited

Loading