Order of powers of x make depthwise separable convolution behave poorly #8

danyk98 · 2022-05-26T23:11:59Z

Since x = cat([(x ** i) for i in range(1, self.q + 1)], dim=1) stacks the newly created powers of x along the channels dimension, it makes depthwise seperable convolution behave poorly since the filter passes over a mixture of channels and powers. If we look at the case where C_in=2 and q=3, the channel axis looks like: x₁, x₂, x₁², x₂², x₁³, x₂³.

With groups=2 in depthwise convolution, the first filter would pass over x₁, x₂, x₁², and the second filter would pass over x₂², x₁³, x₂³

I fixed this issue by re-arrranging the output of this line using fancy indexing:
permutation = [(i * self.in_channels) % (self.in_channels * self.q) + i // self.q for i in range(self.in_channels * self.q)]
x = cat([(x ** i) for i in range(1, self.q + 1)], dim=1)[:, permutation]

The channel axis now looks like x₁, x₁², x₁³, x₂, x₂², x₂³, so the filters pass over the powers of x₁ and x₂ individually.

fastonn/fastonn/SelfONN.py

Line 276 in 9591a31

x = cat([(x ** i) for i in range(1, self.q + 1)], dim=1)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Order of powers of x make depthwise separable convolution behave poorly #8

Order of powers of x make depthwise separable convolution behave poorly #8

danyk98 commented May 26, 2022 •

edited

Loading

Order of powers of x make depthwise separable convolution behave poorly #8

Order of powers of x make depthwise separable convolution behave poorly #8

Comments

danyk98 commented May 26, 2022 • edited Loading

danyk98 commented May 26, 2022 •

edited

Loading