Quantization
quantization
Reducing number of bits for model weights and activations
Method for model-deployment-edge
Types:
- Float baseline
- post-training-quantization → dynamic range, integer
- quantization-aware-training
Supported by tensorflow-lite
Reducing number of bits for model weights and activations
Method for model-deployment-edge
Types:
Supported by tensorflow-lite