Incorrect type in output of `utils.pad_across_processes` when input is `torch.bool` #3218

mariusarvinte · 2024-11-04T22:47:56Z

System Info

- `Accelerate` version: 1.1.0
- Platform: Linux-6.8.0-45-generic-x86_64-with-glibc2.35
- `accelerate` bash location: .venv/bin/accelerate
- Python version: 3.11.10
- Numpy version: 2.1.1
- PyTorch version (GPU?): 2.4.1+cu121 (True)
- PyTorch XPU available: False
- PyTorch NPU available: False
- PyTorch MLU available: False
- PyTorch MUSA available: False
- System RAM: 755.50 GB
- GPU type: NVIDIA RTX 6000 Ada Generation
- `Accelerate` default config:
        Not found

Information

The official example scripts
My own modified scripts

Tasks

One of the scripts in the examples/ folder of Accelerate or an officially supported no_trainer script in the examples folder of the transformers repo (such as run_no_trainer_glue.py)
My own task or dataset (give details below)

Reproduction

Running the following code using accelerate launch example.py

import torch
from accelerate import Accelerator
from accelerate.utils import pad_across_processes

accelerator = Accelerator()

process_tensor = (torch.randn(2, 100*(accelerator.process_index + 1)) > 0).to(accelerator.device)
print(f"{process_tensor.shape = }, {process_tensor.dtype = }, {accelerator.process_index = }")

padded_tensor = pad_across_processes(process_tensor, dim=1)
print(f"{padded_tensor.shape = }, {padded_tensor.dtype = }, {accelerator.process_index = }")

On a machine with at least two GPUs will output (example for three GPUs):

process_tensor.shape = torch.Size([2, 300]), process_tensor.dtype = torch.bool, accelerator.process_index = 2
process_tensor.shape = torch.Size([2, 100]), process_tensor.dtype = torch.bool, accelerator.process_index = 0
process_tensor.shape = torch.Size([2, 200]), process_tensor.dtype = torch.bool, accelerator.process_index = 1

padded_tensor.shape = torch.Size([2, 300]), padded_tensor.dtype = torch.bool, accelerator.process_index = 2
padded_tensor.shape = torch.Size([2, 300]), padded_tensor.dtype = torch.int64, accelerator.process_index = 0
padded_tensor.shape = torch.Size([2, 300]), padded_tensor.dtype = torch.int64, accelerator.process_index = 1

The padded tensors have the incorrect data type of torch.int64 and there is cross-device mismatch, which will further make downstream (e.g., gather) ops freeze and hard to debug.

Expected behavior

The output tensor should have the same dtype on all devices, and it should be the same as the input dtype

The text was updated successfully, but these errors were encountered:

mariusarvinte · 2024-11-04T22:51:53Z

I'll have a PR for this soon

mariusarvinte linked a pull request Nov 5, 2024 that will close this issue

Ensure explicit output dtype for pad_across_processes #3219

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect type in output of `utils.pad_across_processes` when input is `torch.bool` #3218

Incorrect type in output of `utils.pad_across_processes` when input is `torch.bool` #3218

mariusarvinte commented Nov 4, 2024 •

edited

Loading

mariusarvinte commented Nov 4, 2024

Incorrect type in output of utils.pad_across_processes when input is torch.bool #3218

Incorrect type in output of utils.pad_across_processes when input is torch.bool #3218

Comments

mariusarvinte commented Nov 4, 2024 • edited Loading

System Info

Information

Tasks

Reproduction

Expected behavior

mariusarvinte commented Nov 4, 2024

Incorrect type in output of `utils.pad_across_processes` when input is `torch.bool` #3218

Incorrect type in output of `utils.pad_across_processes` when input is `torch.bool` #3218

mariusarvinte commented Nov 4, 2024 •

edited

Loading