can use multi-boxes prompt to fine-tune SAM? #775

wu2233 · 2024-09-18T08:56:14Z

 Hello, I hope to use multi-boxes prompts to fine-tune SAM (not for prediction). Assuming my training batch size is set to 2, that is, two images, and each image has 3 prompt boxes, so I created my prompt tensor with input_boxes = torch.randn(2,3,4).to('cuda'), but I encountered this error in the prompt_encoder.py:

sparse_embeddings = torch.cat([sparse_embeddings, box_embeddings], dim=1)
RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 2 but got size 6 for tensor number 1 in the list.
I am not sure the shape of 'input boxes tensor' should be (3,4) or (2,3,4). The former is OK for the program, but the latter is throwing the error. I hope to get some help, thank you.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can use multi-boxes prompt to fine-tune SAM? #775

can use multi-boxes prompt to fine-tune SAM? #775

wu2233 commented Sep 18, 2024

can use multi-boxes prompt to fine-tune SAM? #775

can use multi-boxes prompt to fine-tune SAM? #775

Comments

wu2233 commented Sep 18, 2024