interface DeploymentQuantization {
    bits: number;
}

Properties

Properties

bits: number

The number of bits each model parameter should be quantized to. The quantization method is chosen based on this value.