Skip to content

Quantization

quantization

Reducing number of bits for model weights and activations

Method for model-deployment-edge

Types:

  1. Float baseline
  2. post-training-quantization → dynamic range, integer
  3. quantization-aware-training

Supported by tensorflow-lite


*References

On this page