com.microsoft - QuantizeWithOrder#

QuantizeWithOrder - 1#

Version

This version of the operator has been available since version 1 of domain com.microsoft.

Summary

Attributes

order_input - INT (required) : cublasLt order of input matrix. ORDER_COL = 0, ORDER_ROW = 1, ORDER_COL32 = 2, ORDER_COL4_4R2_8C = 3, ORDER_COL32_2R_4R4 = 4. Please refer https://docs.nvidia.com/cuda/cublas/index.html#cublasLtOrder_t for their meaning.
order_output - INT (required) : cublasLt order of output matrix.

Inputs

Outputs

Type Constraints

Examples