quantize
Create quantized deep neural network
Description
creates a quantized neural network object using a calibrated quantizedNetwork
= quantize(quantObj
)dlquantizer
object
specified as quantObj
. Quantized neural network object, specified as
quantizedNetwork
enables visibility of the quantized layers, weights,
and biases of the network, as well as quantized inference behavior.
creates a quantized neural network object using a calibrated quantizedNetwork
= quantize(quantObj
,Name,Value
)dlquantizer
object
specified as quantObj
with additional arguments specified by one or more
name name-value pair arguments.
Examples
Input Arguments
Output Arguments
Limitations
For C/C++ and CUDA code generation, the software generates code for a convolutional deep neural network by quantizing the weights, biases, and activations of the convolution layers to 8-bit scaled integer data types. The quantization is performed by providing the calibration result file produced by the
calibrate
function to thecodegen
(MATLAB Coder) command.Code generation does not support quantized deep neural networks produced by the
quantize
function.