com.microsoft - DequantizeBFP#

DequantizeBFP - 1#

Version

  • name: DequantizeBFP (GitHub)

  • domain: com.microsoft

  • since_version: 1

  • function:

  • support_level: SupportType.COMMON

  • shape inference: True

This version of the operator has been available since version 1 of domain com.microsoft.

Summary

Attributes

  • bfp_type - INT (required) : The type of BFP - must match with the BFPType enum

  • block_dim - INT : Each bounding box spans this dimension.Typically, the block dimension corresponds to the reduction dimension of the matrix multipication that consumes the output of this operator.For example, for a 2D matrix multiplication A@W, QuantizeBFP(A) would use block_dim 1 and QuantizeBFP(W) would use block_dim 0.The default is the last dimension.

  • dtype - INT : The datatype to dequantize to.

Inputs

  • x (heterogeneous) - T1:

  • shape (heterogeneous) - T2:

  • strides (heterogeneous) - T2:

Outputs

  • y (heterogeneous) - T3:

Type Constraints

  • T1 in ( tensor(uint8) ): Constrain the input to uint8.

  • T2 in ( tensor(int64) ): Constrain shape and strides to uint64.

  • T3 in ( tensor(bfloat16), tensor(float), tensor(float16) ): Constrain y to float and bfloat16.

Examples