Change auto contiguous to always insert contiguous #2038

kahmed10 · 2023-08-04T15:44:42Z

No description provided.

…_module

migraphx-bot · 2023-08-04T17:41:26Z

Test	Batch	Rate new e75181	Rate old 368d59	Diff	Compare
torchvision-resnet50	64	2,830.77	2,831.78	-0.04%	✅
torchvision-resnet50_fp16	64	6,501.60	6,493.90	0.12%	✅
torchvision-densenet121	32	2,095.86	2,095.43	0.02%	✅
torchvision-densenet121_fp16	32	3,657.68	3,663.28	-0.15%	✅
torchvision-inceptionv3	32	1,595.17	1,594.90	0.02%	✅
torchvision-inceptionv3_fp16	32	2,561.50	2,560.92	0.02%	✅
cadene-inceptionv4	16	721.53	721.62	-0.01%	✅
cadene-resnext64x4	16	693.84	690.65	0.46%	✅
slim-mobilenet	64	8,437.92	8,320.61	1.41%	✅
slim-nasnetalarge	64	189.10	231.48	-18.31%	🔴
slim-resnet50v2	64	2,693.74	2,662.58	1.17%	✅
bert-mrpc-onnx	8	812.17	813.41	-0.15%	✅
bert-mrpc-tf	1	376.71	386.87	-2.62%	✅
pytorch-examples-wlang-gru	1	350.52	301.62	16.21%	🔆
pytorch-examples-wlang-lstm	1	321.02	313.73	2.32%	✅
torchvision-resnet50_1	1	601.66	605.31	-0.60%	✅
torchvision-inceptionv3_1	1	340.44	345.94	-1.59%	✅
cadene-dpn92_1	1	402.28	401.41	0.22%	✅
cadene-resnext101_1	1	328.20	327.72	0.15%	✅
slim-vgg16_1	1	380.60	458.12	-16.92%	🔴
slim-mobilenet_1	1	2,039.68	2,054.76	-0.73%	✅
slim-inceptionv4_1	1	194.87	214.67	-9.22%	🔴
onnx-taau-downsample	1	285.74	303.28	-5.78%	🔴
dlrm-criteoterabyte	1	23.49	21.58	8.87%	🔆
dlrm-criteoterabyte_fp16	1	41.04	40.64	0.97%	✅
agentmodel	1	6,387.25	6,036.02	5.82%	🔆
unet_fp16	2	54.56	54.73	-0.31%	✅
resnet50v1_fp16	1	929.40	928.82	0.06%	✅
bert_base_cased_fp16	64	924.06	923.61	0.05%	✅
bert_large_uncased_fp16	32	290.39	290.25	0.05%	✅
bert_large_fp16	1	167.69	171.90	-2.45%	✅
distilgpt2_fp16	16	1,514.84	1,512.92	0.13%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2023-08-04T17:41:27Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ torchvision-inceptionv3_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ slim-vgg16_1: PASSED: MIGraphX meets tolerance

✅ slim-mobilenet_1: PASSED: MIGraphX meets tolerance

✅ slim-inceptionv4_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

✅ bert_large_uncased_fp16: PASSED: MIGraphX meets tolerance

✅ bert_large: PASSED: MIGraphX meets tolerance

🔴distilgpt2_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

pfultz2 · 2023-08-07T15:54:54Z

src/auto_contiguous.cpp

+            if(not ins->module_inputs().empty())
+                m.replace_instruction(ins, ins->get_operator(), new_args, ins->module_inputs());
+            else
+                m.replace_instruction(ins, ins->get_operator(), new_args);


You shouldn't replace the input arguments as that can cause multiple contiguous operators to be inserted when its used in multiple places. This should just do:

if(contains({"layout", "contigous"}, ins->name())) continue; // for last instruction that is NOT a return if(ins->outputs().empty() and ins != last) continue; shape s = ins->get_shape(); if(s.dynamic()) continue; if(s.type() == shape::tuple_type) continue; if(s.standard() and ins->name() == "@literal") continue; auto c = m.insert_instruction(std::next(ins), make_op("contiguous"), ins); m.replace_instruction(ins, c);

pfultz2 · 2023-08-07T15:57:55Z

src/auto_contiguous.cpp

-                    return in;
-                }
-                return m.insert_instruction(ins, make_op("contiguous"), in);
-            });


We should still have this until reshape copy is implemented.

… of program

pfultz2 · 2023-08-11T14:52:34Z

src/auto_contiguous.cpp

@@ -59,17 +60,24 @@ void auto_contiguous::apply(module& m) const
    auto last = std::prev(m.end());
    for(auto ins : iterator_for(m))
    {
-        if(ins->name() == "layout")
+        if(contains({"layout", "contiguous", "@return", "@param", "@outline"}, ins->name()))


We shouldn't be skipping @param, is there a test that fails for that?

I dont think @outline is used anywhere.

pfultz2 · 2023-08-11T14:52:55Z

src/eliminate_contiguous.cpp

@@ -161,6 +162,18 @@ static void remove_contiguous(const std::string& op_name, module& m, F f)
    }
 }

+static void remove_contiguous_noops(const std::string& op_name, module& m)


I would spell it nop with one o.

pfultz2 · 2023-08-11T14:56:07Z

test/auto_contiguous_test.cpp

-        auto r    = m2.add_instruction(migraphx::make_op("reshape", {{"dims", {2, 1, 12, 5}}}), ca);
-        m2.add_return({r});
+        // extra contiguous coming from reshape logic which has "requires_std_shape" attribute
+        auto cb = m2.add_instruction(migraphx::make_op("contiguous"), ca);


We shouldnt have two contiguous. Maybe we should check that the outputs are all contiguous operators before inserting it in.

pfultz2 · 2023-08-11T14:57:38Z

src/auto_contiguous.cpp

+            continue;
+        if(s.standard() and ins->name() == "@literal")
+            continue;
+        if(s.scalar() and not contains(ins->name(), "broadcast"))


I would think this would be if(s.scalar() and s.standard()).

… perform jit compilation

… modes into a separate header

…_reduction_modes

…' into scatter_elements_reduction_modes

…ftwarePlatform/AMDMIGraphX into auto_contig_fix

codecov · 2024-01-11T21:19:57Z

Codecov Report

Attention: 3 lines in your changes are missing coverage. Please review.

Comparison is base (9941849) 91.43% compared to head (e751814) 91.30%.
Report is 4 commits behind head on develop.

Files	Patch %	Lines
src/propagate_constant.cpp	40.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #2038      +/-   ##
===========================================
- Coverage    91.43%   91.30%   -0.14%     
===========================================
  Files          461      461              
  Lines        17419    17451      +32     
===========================================
+ Hits         15927    15933       +6     
- Misses        1492     1518      +26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

TedThemistokleous · 2024-01-15T16:57:13Z

Why close this out? Guessing rebase and push forward or not needed anymore?

kahmed10 added 3 commits August 4, 2023 08:43

changed auto contiguous to no longer check shape, move after optimize…

0ac5a5d

…_module

formatting

7900d1a

move auto contiguous back to original position

a8409ab

causten assigned kahmed10 Aug 4, 2023

kahmed10 added 2 commits August 4, 2023 17:06

update to replace module inputs, move pass down

8b28251

skip adding contiguous for scalars

3628df8

pfultz2 reviewed Aug 7, 2023

View reviewed changes

kahmed10 added 3 commits August 8, 2023 08:07

revert insertions to after instruction, update logic

0ede5e1

update tests, update eliminate_contiguous to remove contiguous at end…

225cd3a

… of program

formatting

7aee638

pfultz2 reviewed Aug 11, 2023

View reviewed changes

pfultz2 mentioned this pull request Aug 21, 2023

Modify reshapes #2099

Merged

TedThemistokleous self-requested a review August 21, 2023 19:33

kahmed10 closed this Aug 30, 2023

kahmed10 and others added 10 commits September 8, 2023 20:32

fixes to optimize_module pass

8949b9b

Add support for multiple reduction modes in scatter operator

b638698

Rename scatter op to scatter_elements, refactor gpu implementation to…

d77e2d4

… perform jit compilation

Create common compiler base for scatter operators, and pull reduction…

8e5e8fe

… modes into a separate header

Merge branch 'scatter_common' into scatter_elements_reduction_modes

d4b8057

Refactor ScatterElements compiler to use base scatter compiler

242411d

Fix clang tidy and clang format warnings

68a652d

Merge branch 'scatter_common' into scatter_elements_reduction_modes

cf801c5

Correct onnx, reference and verify tests for scatter_elements

a341036

remove incorrect optimization

1af66a1

kahmed10 and others added 26 commits December 6, 2023 22:18

manual merge

e9a3bda

remove extra simplify_algebra change

222289c

Merge remote-tracking branch 'upstream/develop' into scatter_elements…

6241ee6

…_reduction_modes

Rename scatter_elements_* operators to scatter_*

0df8f4c

Merge remote-tracking branch 'upstream/develop' into scatter_elements…

e9ad99e

…_reduction_modes

Merge remote-tracking branch 'origin/scatter_elements_reduction_modes…

f3ac705

…' into scatter_elements_reduction_modes

add contiguous to inputs

4891482

formatting

6df5ad9

update condition for nonstd params

ec60a90

manual merge

e11d3eb

formatting

c1fa377

manual merge

651e479

formatting

175ef0d

Fix mlir submodule input parameters

46df23e

Format

fcc6595

Update year

892c503

Add an assert for the parameter shape

0fa11e1

Merge branch 'develop' into mlir-submodule-parameters

b517ff2

Update license

de7dbd8

disable custom_op test, revert fuse_mlir

7040ed1

manual merge

dbef283

formatting

2e93bf4

Merge branch 'mlir-submodule-parameters' of https://github.com/ROCmSo…

92bb51d

…ftwarePlatform/AMDMIGraphX into auto_contig_fix

fix merge

4726c60

update copyright years

bbc9aea

revert change

9475612

revert files

e751814

kahmed10 closed this Jan 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change auto contiguous to always insert contiguous #2038

Change auto contiguous to always insert contiguous #2038

kahmed10 commented Aug 4, 2023

migraphx-bot commented Aug 4, 2023 •

edited

Loading

migraphx-bot commented Aug 4, 2023 •

edited

Loading

pfultz2 Aug 7, 2023

pfultz2 Aug 7, 2023

pfultz2 Aug 11, 2023

pfultz2 Aug 11, 2023

pfultz2 Aug 11, 2023

pfultz2 Aug 11, 2023

codecov bot commented Jan 11, 2024 •

edited

Loading

TedThemistokleous commented Jan 15, 2024

Change auto contiguous to always insert contiguous #2038

Change auto contiguous to always insert contiguous #2038

Conversation

kahmed10 commented Aug 4, 2023

migraphx-bot commented Aug 4, 2023 • edited Loading

migraphx-bot commented Aug 4, 2023 • edited Loading

pfultz2 Aug 7, 2023

Choose a reason for hiding this comment

pfultz2 Aug 7, 2023

Choose a reason for hiding this comment

pfultz2 Aug 11, 2023

Choose a reason for hiding this comment

pfultz2 Aug 11, 2023

Choose a reason for hiding this comment

pfultz2 Aug 11, 2023

Choose a reason for hiding this comment

pfultz2 Aug 11, 2023

Choose a reason for hiding this comment

codecov bot commented Jan 11, 2024 • edited Loading

Codecov Report

TedThemistokleous commented Jan 15, 2024

migraphx-bot commented Aug 4, 2023 •

edited

Loading

migraphx-bot commented Aug 4, 2023 •

edited

Loading

codecov bot commented Jan 11, 2024 •

edited

Loading