Question about computation reduction #3

Planck35 · 2018-12-11T06:08:01Z

You did a very nice implement
But I want to ask for the weight that got masked by zero in weights.

Did the whole computation increase but weight's value are zero?
or the computation speed is just normal?

ruihangdu · 2018-12-25T02:45:08Z

@Planck35 By computation do you mean the amount of computation (i.e. number of floating-point operations)? If so, then no, the amount of computation would be roughly the same after pruning but should not increase.

chen-ming2019 · 2019-06-30T02:25:24Z

@larry0123du Hi,I used the code in my model,but the pruning model size is perfectly equal to the model size before pruning. What is the reason for the phenomenon?

ruihangdu · 2019-06-30T13:03:26Z

The reason is that the weights are simply truncated to zero but zero is still represented as a floating point number. So in essence, as long as the size of matrices is unchanged, your model would not change in size. In the original paper Han et al supplemented with a Huffman encoding scheme which would boost the performance if I remembered right.

chen-ming2019 · 2019-06-30T13:54:13Z

@larry0123du ok ,thanks!

theoldgun · 2019-08-07T10:32:23Z

@larry0123du hello,the weights are simply truncated to zero,will the inference speed increase?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about computation reduction #3

Question about computation reduction #3

Planck35 commented Dec 11, 2018

ruihangdu commented Dec 25, 2018

chen-ming2019 commented Jun 30, 2019

ruihangdu commented Jun 30, 2019

chen-ming2019 commented Jun 30, 2019

theoldgun commented Aug 7, 2019

Question about computation reduction #3

Question about computation reduction #3

Comments

Planck35 commented Dec 11, 2018

ruihangdu commented Dec 25, 2018

chen-ming2019 commented Jun 30, 2019

ruihangdu commented Jun 30, 2019

chen-ming2019 commented Jun 30, 2019

theoldgun commented Aug 7, 2019