Enable multi-device for efficientnet #29989

jla524 · 2024-04-02T06:29:59Z

What does this PR do?

Fixes #29786 (issue)

Tested on two systems, one with 2x RTX 3060 and one with 2x RTX 3090

$ pytest tests/models/efficientnet/test_modeling_efficientnet.py -k "parallel or offload"
=================================================================== test session starts ===================================================================
platform linux -- Python 3.10.13, pytest-7.4.4, pluggy-1.4.0
rootdir: /repos/transformers
configfile: pyproject.toml
plugins: hypothesis-6.98.10, xdist-3.5.0, timeout-2.3.1, anyio-4.3.0
collected 108 items / 101 deselected / 7 selected                                                                                                         

tests/models/efficientnet/test_modeling_efficientnet.py .......                                                                                     [100%]

<warnings redacted>
===================================================== 7 passed, 101 deselected, 12 warnings in 8.85s ======================================================

Who can review?

@amyeroberts

ArthurZucker · 2024-04-02T08:49:48Z

src/transformers/models/efficientnet/modeling_efficientnet.py

@@ -484,6 +484,7 @@ class EfficientNetPreTrainedModel(PreTrainedModel):
    config_class = EfficientNetConfig
    base_model_prefix = "efficientnet"
    main_input_name = "pixel_values"
+    _no_split_modules = []


Suggested change

_no_split_modules = []

_no_split_modules = ["EfficientNetBlock"]

any reason not to set a correct module to not split?

@ArthurZucker I don't think it's necessary for there to be a module defined - this is the case for some of our models already in the library e.g. Camembert.

It is strange not defining "EfficientNetBlock" is OK though, as the block uses a residual connection, which requires the two tensors to be on the same device (as then so too the weights).

Alright, it's not necessary, but for small GPU it helps. We'll see how it goes

feat: enable mult-idevice for efficientnet

feat: enable mult-idevice for efficientnet

5844b8d

ArthurZucker reviewed Apr 2, 2024

View reviewed changes

ArthurZucker approved these changes Apr 2, 2024

View reviewed changes

amyeroberts merged commit 03732de into huggingface:main Apr 3, 2024
17 checks passed

amyeroberts mentioned this pull request Apr 3, 2024

Community contribution: enabling device_map="auto" support for more vision and multimodal models #29786

Open

59 tasks

jla524 deleted the efficientnet_multidevice branch April 12, 2024 05:26

ArthurZucker pushed a commit that referenced this pull request Apr 22, 2024

Enable multi-device for efficientnet (#29989)

b9a19a6

feat: enable mult-idevice for efficientnet

itazap pushed a commit that referenced this pull request May 14, 2024

Enable multi-device for efficientnet (#29989)

2fcb56a

feat: enable mult-idevice for efficientnet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable multi-device for efficientnet #29989

Enable multi-device for efficientnet #29989

jla524 commented Apr 2, 2024

ArthurZucker Apr 2, 2024

amyeroberts Apr 2, 2024

ArthurZucker Apr 2, 2024 •

edited

Loading

	_no_split_modules = []
	_no_split_modules = ["EfficientNetBlock"]

Enable multi-device for efficientnet #29989

Enable multi-device for efficientnet #29989

Conversation

jla524 commented Apr 2, 2024

What does this PR do?

Who can review?

ArthurZucker Apr 2, 2024

Choose a reason for hiding this comment

amyeroberts Apr 2, 2024

Choose a reason for hiding this comment

ArthurZucker Apr 2, 2024 • edited Loading

Choose a reason for hiding this comment

ArthurZucker Apr 2, 2024 •

edited

Loading