DSQ

pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"

The Origin Paper : https://arxiv.org/abs/1908.05033

This repository follow the Algorithm 1 in the paper.
As mention in the paper:

For clipping value l and u, we try the following two strategies:  
moving average statistics and optimization by backward propagation.

Because the paper fine-tunes from the pre-trained model, so it can find the clipping value.
Instead of using the strategies, this repository uses the max value of int32 as the initial value.
It should not affect the value range (because the parameter of the deep model should not too large), and most of the edge device range is up to int32.

Note

note that this has not been test yet

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DSQConv.py		DSQConv.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DSQ

Note

About

Releases

Packages

Languages

YumingChang02/DSQ

Folders and files

Latest commit

History

Repository files navigation

DSQ

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages