Description: Relative work in Paper #13

zwxdxcm · 2023-11-03T04:16:37Z

I am wondering if this description is correct? Why there is not QAT in blue highlight?

Thanks!

zwxdxcm · 2023-11-03T06:40:46Z

I mean ... If you wanna train neural field from scratch, it is reasonable to implementing E2E compression, which should use QAT. For pre-trained model, it would be better to use PTQ.

daniel03c1 · 2023-11-03T08:41:04Z

Sorry for the confusion. You are right, and we were incorrectly written. Thank you.

zwxdxcm · 2023-11-03T08:45:50Z

Sorry for the confusion. You are right, and we were incorrectly written. Thank you.

Thanks for your reply. Double check, so this work implement QAT since it trains network from scratch. Have you compare how many time spent on the additional computations during both training and inference?

zwxdxcm · 2023-11-03T08:50:20Z

And I also have another 2 questions.

This work's target is to reduce memory(running time storage) instead of model storage, right? So the depicted in experiments part is memory?
I am wondering how to design the mask process? Is there any relative work? cause I cannot understand it LOL.

Thank you !

zwxdxcm · 2023-11-03T08:59:29Z

Oh i understand it seems like a threshold function. but why would you use stop gradient operator?

daniel03c1 · 2023-11-03T10:57:38Z

Thank you for your interest in our work.

Regarding the time, it only increases the training time, and the exact training times are described in the supplementary. During inference, wavelet coefficients only need to be converted once, which takes a constant time to depack sparse representations into spatial grids, and thus the inference time is equal to the original model without mask or wavelet transformation.
The work only considers storage memory. Regarding the memory, it might need extra memory during training, but during inference, it will require as much as the original model requires.
I believe this issue is related to About detach() function #10. If you have further questions, please feel free to ask.

zwxdxcm · 2023-11-03T11:21:14Z

OK. Thanks ~~

zwxdxcm · 2023-11-03T11:23:01Z

For what I understood, the masking part is more like a learnable frequency filter, am I correct?

daniel03c1 · 2023-11-04T07:24:39Z

It is not necessarily a frequency filter. The masking method itself can filter whatever you want. For example, if you apply the masking method to spatial grids, it filters spacial coefficients. If you apply it to frequency grids (after DCT), you get frequency filters.

zwxdxcm · 2023-11-06T09:31:38Z

Thanks for your reply ！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Description: Relative work in Paper #13

Description: Relative work in Paper #13

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 4, 2023

zwxdxcm commented Nov 6, 2023

Description: Relative work in Paper #13

Description: Relative work in Paper #13

Comments

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

zwxdxcm commented Nov 3, 2023

daniel03c1 commented Nov 4, 2023

zwxdxcm commented Nov 6, 2023