com.microsoft - DequantizeBFP#

DequantizeBFP - 1#

Version

This version of the operator has been available since version 1 of domain com.microsoft.

Summary

Attributes

bfp_type - INT (required) : The type of BFP - must match with the BFPType enum
block_dim - INT : Each bounding box spans this dimension.Typically, the block dimension corresponds to the reduction dimension of the matrix multipication that consumes the output of this operator.For example, for a 2D matrix multiplication A@W, QuantizeBFP(A) would use block_dim 1 and QuantizeBFP(W) would use block_dim 0.The default is the last dimension.
dtype - INT : The datatype to dequantize to.

Inputs

Outputs

Type Constraints

T1 in ( tensor(uint8) ): Constrain the input to uint8.
T2 in ( tensor(int64) ): Constrain shape and strides to uint64.
T3 in ( tensor(bfloat16), tensor(float), tensor(float16) ): Constrain y to float and bfloat16.

Examples