Numpy to ONNX: Create ONNX graphs with an API similar to numpy#

Many people came accross the task of converting a pipeline including a custom preprocessing embedded into a sklearn.preprocessing.FunctionTransformer. sklearn-onnx implements many converters. Their task is to create an ONNX graph for every scikit-learn model included in a pipeline. Every converter is a new implementation of methods predict, predict_proba or transform with ONNX Operators. But that does not include custom function. Writing a converter can be quite verbose and requires to know the ONNX Operators, similar to numpy but not the same.

The goal here is to make it easier for users and have their custom function converted in ONNX. Everybody playing with scikit-learn knows numpy then it should be possible to write a function using numpy and automatically have it converted into ONNX. This tutorial focuses more on the implementation of custom transformer for scikit-learn. Notebook Loss function in ONNX focuses on the implementation of loss functions to train machine learned models.

This API was first added to mlprodict in version 0.6.

Principle
Available functions
FunctionTransformer
Custom Predictor or Transformer
- Custom Classifier
- Custom Transformer
More options
How to convert inplace modifications
Common errors

Available notebooks:

Principle #

The user writes a function using numpy function but behind the scene, it uses an ONNX runtime to execute the function. To do that, this package reimplements many numpy functions using ONNX Operators. It looks like numpy but it uses ONNX. Following example shows how to replace numpy by ONNX.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray

# The numpy function


def log_1(x):
    return np.log(x + 1)

# The ONNX function


@onnxnumpy_default
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(x + np.float32(1))


x = np.random.rand(2, 3).astype(np.float32)

print('numpy')
print(log_1(x))

print('onnx')
print(onnx_log_1(x))

>>>

    numpy
    [[0.661 0.268 0.412]
     [0.495 0.171 0.598]]
    onnx
    [[0.661 0.268 0.412]
     [0.495 0.171 0.598]]

ONNX runtimes are usually more strict about types than numpy (see onnxruntime). A function must be implemented for the same input type and there is not implicit cast. There are three important elements in this example:

Decorator onnxnumpy_default: it parses the annotations, creates the ONNX graph and initialize a runtime with it.
Annotation: every input and output types must be specified. They are NDArray, shape can be left undefined by element type must be precised.
Types: 1 is different than np.float32(1), the right type must be used.

onnx_log_1 is not a function but an instance of class wrapper_onnxnumpy. This class implements method __call__ to behave like a function and holds an attribute of type OnnxNumpyCompiler. This class contains an ONNX graph and a instance of a runtime. The following lines lists some usefull attributes.

onnx_log_1: wrapper_onnxnumpy
onnx_log_1.compiled: OnnxNumpyCompiler
onnx_log_1.compiled.onnx_: ONNX graph
onnx_log_1.compiled.rt_fct_.rt: runtime, by default OnnxInference

The ONNX graph onnx_log_1.compiled.onnx_ looks like this:

There is a fundamental different between numpy and ONNX. numpy allows inplace modifications. The simple instruction m[:, 0] = 1 modifies an entire column of an existing array. ONNX does not allow that, even if the same operator can be achieved, the result is a new array. See section How to convert inplace modifications for more details. This API intends to be easy to use than the current ONNX API or the other introduced in skl2onnx which looks like this:

This API requires to know ONNX operators and an extra step to convert code written with numpy into code written with ONNX operators. Even though the operators are similar, signatures may be different.

Available functions #

This tool does not implement every function of numpy. This a work in progress. The list of supported functions is available at module npy.numpy_onnx_impl.

Common operators +, -, /, *, **, %, [] are supported as well. They are implemented by class OnnxVar. This class also implements methods such as astype or properties such as shape, size, T.

FunctionTransformer #

Now onnx was used to implement a custom function, it needs to used by a sklearn.preprocessing.FunctionTransformer. One instance is added in a pipeline trained on the Iris dataset.

<<<

from typing import Any
import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import FunctionTransformer, StandardScaler
from sklearn.linear_model import LogisticRegression
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray
from mlprodict.onnx_conv import to_onnx
from mlprodict.onnxrt import OnnxInference


@onnxnumpy_default
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[(None, None), np.float32]:
    return npnx.log(x + np.float32(1))


data = load_iris()
X, y = data.data.astype(np.float32), data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

pipe = make_pipeline(
    FunctionTransformer(onnx_log_1),
    StandardScaler(),
    LogisticRegression())
pipe.fit(X_train, y_train)
print(pipe.predict_proba(X_test[:2]))

onx = to_onnx(pipe, X_train[:1], rewrite_ops=True,
              options={LogisticRegression: {'zipmap': False}})
oinf = OnnxInference(onx)
print(oinf.run({'X': X_test[:2]})['probabilities'])

>>>

    [[9.569e-01 4.308e-02 1.977e-07]
     [6.215e-04 3.500e-01 6.494e-01]]
    [[9.569e-01 4.308e-02 1.977e-07]
     [6.215e-04 3.500e-01 6.494e-01]]
    [runpythonerror]
    /usr/local/lib/python3.9/site-packages/xgboost/compat.py:36: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
      from pandas import MultiIndex, Int64Index

ONNX is still more strict than numpy. Some elements must be added every time this is used:

The custom function signature is using float32, training and testing data are cast in float32.
The shape of onnx_log_1 return was changed into NDArray[(None, None), np.float32]. Otherwise the converter for StandardScaler raised an exception (see Shape mismatch).
Method to_onnx is called with parameter rewrite_ops=True. This parameter tells the function to overwrite the converter for FunctionTransformer by a new one which supports custom functions implemented with this API (see Missing converter).

Custom Predictor or Transformer #

Creating a custom predictor or transformer is not a common task but still not too difficult with scikit-learn API. It becomes more difficult task when it comes to convert a pipeline involving this new model into ONNX. It means writing a custom converter or more simply to implement the inference function with ONNX operators. It is difficult because ONNX operators are close to numpy function but not exactly the same plus testing an ONNX conversion requires to use a runtime. That means more lines of code to just test.

Custom Classifier #

The conversion of a classifier is more complex than a regressor or a transformer because a classifier implements two methods, predict for the labels, predict_proba for the probabilities. Next example implements a weird classifier based on two logistic regressions. It does not do anything with ONNX yet. This is taken from notebook Introduction to a numpy API for ONNX: CustomClassifier.

<<<

import numpy
from pandas import DataFrame
from sklearn.base import ClassifierMixin, BaseEstimator
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.datasets import make_classification

X, y = make_classification(200, n_classes=2, n_features=2, n_informative=2,
                           n_redundant=0, n_clusters_per_class=2, hypercube=False)

X_train, X_test, y_train, y_test = train_test_split(X, y)


class TwoLogisticRegression(ClassifierMixin, BaseEstimator):

    def __init__(self):
        ClassifierMixin.__init__(self)
        BaseEstimator.__init__(self)

    def fit(self, X, y, sample_weights=None):
        if sample_weights is not None:
            raise NotImplementedError(
                "weighted sample not implemented in this example.")

        # Barycenters
        self.weights_ = numpy.array([(y == 0).sum(), (y == 1).sum()])
        p1 = X[y == 0].sum(axis=0) / self.weights_[0]
        p2 = X[y == 1].sum(axis=0) / self.weights_[1]
        self.centers_ = numpy.vstack([p1, p2])
        self.classes_ = numpy.array([0, 1])

        # A vector orthogonal
        v = p2 - p1
        v /= numpy.linalg.norm(v)
        x = numpy.random.randn(X.shape[1])
        x -= x.dot(v) * v
        x /= numpy.linalg.norm(x)
        self.hyperplan_ = x.reshape((-1, 1))

        # sign
        sign = ((X - p1) @ self.hyperplan_ >= 0).astype(numpy.int64).ravel()

        # Trains models
        self.lr0_ = LogisticRegression().fit(X[sign == 0], y[sign == 0])
        self.lr1_ = LogisticRegression().fit(X[sign == 1], y[sign == 1])

        return self

    def predict_proba(self, X):
        sign = self.predict_side(X).reshape((-1, 1))
        prob0 = self.lr0_.predict_proba(X)
        prob1 = self.lr1_.predict_proba(X)
        prob = prob1 * sign - prob0 * (sign - 1)
        return prob

    def predict(self, X):
        prob = self.predict_proba(X)
        return prob.argmax(axis=1)

    def predict_side(self, X):
        return ((X - self.centers_[0]) @ self.hyperplan_ >= 0).astype(numpy.int64).ravel()


model = TwoLogisticRegression()
model.fit(X_train, y_train)
print(model.predict(X_test[:5]), model.predict_proba(X_test[:5]))

>>>

    [0 1 1 0 1] [[0.955 0.045]
     [0.186 0.814]
     [0.127 0.873]
     [0.504 0.496]
     [0.242 0.758]]

Next step is to converter this classifier into ONNX. Instead of writing a converter, the strategy is to implement methods predict and predict_proba with ONNX instead of numpy. That’s where the numpy API for ONNX becomes handy, with some decorators to simplifies manythings. Among them, the types. Python does not really care about signed arguments but ONNX does. As a result, if the predict method is used with float32 and float64, two ONNX graphs are created and executed with a runtime. When method predict is called, the input type is detected and an ONNX graph is generated. If the second call uses the same type, the same graph is used. Let’s see how to do it.

<<<

import numpy
from pandas import DataFrame
from sklearn.base import ClassifierMixin, BaseEstimator
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.datasets import make_classification
from mlprodict.npy import onnxsklearn_class
from mlprodict.npy.onnx_variable import MultiOnnxVar
from mlprodict.onnx_conv import to_onnx
import mlprodict.npy.numpy_onnx_impl as nxnp
import mlprodict.npy.numpy_onnx_impl_skl as nxnpskl

X, y = make_classification(200, n_classes=2, n_features=2, n_informative=2,
                           n_redundant=0, n_clusters_per_class=2, hypercube=False)

X_train, X_test, y_train, y_test = train_test_split(X, y)


@onnxsklearn_class('onnx_graph', op_version=14)  # opset=13, 14, ...
class TwoLogisticRegressionOnnx(ClassifierMixin, BaseEstimator):

    def __init__(self):
        ClassifierMixin.__init__(self)
        BaseEstimator.__init__(self)

    def fit(self, X, y, sample_weights=None):
        if sample_weights is not None:
            raise NotImplementedError(
                "weighted sample not implemented in this example.")

        # Barycenters
        self.weights_ = numpy.array([(y == 0).sum(), (y == 1).sum()])
        p1 = X[y == 0].sum(axis=0) / self.weights_[0]
        p2 = X[y == 1].sum(axis=0) / self.weights_[1]
        self.centers_ = numpy.vstack([p1, p2])
        self.classes_ = numpy.array([0, 1])

        # A vector orthogonal
        v = p2 - p1
        v /= numpy.linalg.norm(v)
        x = numpy.random.randn(X.shape[1])
        x -= x.dot(v) * v
        x /= numpy.linalg.norm(x)
        self.hyperplan_ = x.reshape((-1, 1))

        # sign
        sign = ((X - p1) @ self.hyperplan_ >= 0).astype(numpy.int64).ravel()

        # Trains models
        self.lr0_ = LogisticRegression().fit(X[sign == 0], y[sign == 0])
        self.lr1_ = LogisticRegression().fit(X[sign == 1], y[sign == 1])

        return self

    def onnx_graph(self, X):
        h = self.hyperplan_.astype(X.dtype)
        c = self.centers_.astype(X.dtype)

        sign = ((X - c[0]) @ h) >= numpy.array([0], dtype=X.dtype)
        cast = sign.astype(X.dtype).reshape((-1, 1))

        # Function logistic_regression is not a numpy function.
        # It calls the converter for a LogisticRegression
        # implemented in sklearn-onnx.
        prob0 = nxnpskl.logistic_regression(X, model=self.lr0_)[1]
        prob1 = nxnpskl.logistic_regression(X, model=self.lr1_)[1]
        prob = prob1 * cast - prob0 * (cast - numpy.array([1], dtype=X.dtype))
        label = nxnp.argmax(prob, axis=1)
        return MultiOnnxVar(label, prob)


model = TwoLogisticRegressionOnnx()
model.fit(X_train, y_train)
print(model.predict(X_test[:5]), model.predict_proba(X_test[:5]))

onx = to_onnx(model, X_test[:5], target_opset=14)  # opset=13, 14, ...
# print(onx)  # too long to be displayed

>>>

    [1 0 0 0 1] [[0.044 0.956]
     [0.999 0.001]
     [0.929 0.071]
     [0.978 0.022]
     [0.104 0.896]]

The decorator @onnxsklearn_class('onnx_graph') (see onnxsklearn_class) declares method onnx_graph as the method which creates the ONNX graph. In a classifier case, it returns two outputs, label and probabilites assembled within an instance of MultiOnnxVar. The decorator detects the class is a classifier (ClassifierMixin) and linked the two outputs to the two methods predict and predict_proba, in that order. When one of them is called, it follows the steps:

Detects input type,
Detects if an ONNX graph was generated for this type
Generates the ONNX graph if it does not exist
Create an instance with a runtime if it does not exist
Returns the output of the runtime

The instruction to_onnx(model, X_test[:5], target_opset=?) creates an ONNX graph by calling method onnx_graph registered as a converter in skl2onnx. It is equivalent to something like model.onnx_graph(X_test[:5]).to_algebra()[0].to_onnx({'X': X}).

The implementation of method onnx_graph relies on numpy function implemented with ONNX operator from submodule module npy.numpy_onnx_impl and converters for scikit-learn models wrapped into functions from submodule module npy.numpy_onnx_impl_skl.

Custom Transformer #

The syntax is the same. The decorator @onnxsklearn_class("onnx_transform", op_version=?) detects the class is a transformer and automatically adds method transform.

<<<

import numpy
from pandas import DataFrame
from sklearn.base import TransformerMixin, BaseEstimator
from sklearn.decomposition import PCA
from sklearn.model_selection import train_test_split
from sklearn.datasets import make_classification
from mlprodict.npy import onnxsklearn_class
from mlprodict.onnx_conv import to_onnx
import mlprodict.npy.numpy_onnx_impl as nxnp
import mlprodict.npy.numpy_onnx_impl_skl as nxnpskl

X, y = make_classification(200, n_classes=2, n_features=2, n_informative=2,
                           n_redundant=0, n_clusters_per_class=2, hypercube=False)

X_train, X_test, y_train, y_test = train_test_split(X, y)


@onnxsklearn_class("onnx_transform", op_version=14)  # opset=13, 14, ...
class DecorrelateTransformerOnnx(TransformerMixin, BaseEstimator):
    def __init__(self, alpha=0.):
        BaseEstimator.__init__(self)
        TransformerMixin.__init__(self)
        self.alpha = alpha

    def fit(self, X, y=None, sample_weights=None):
        self.pca_ = PCA(X.shape[1])  # pylint: disable=W0201
        self.pca_.fit(X)
        return self

    def onnx_transform(self, X):
        if X.dtype is None:
            raise AssertionError("X.dtype cannot be None.")
        mean = self.pca_.mean_.astype(X.dtype)
        cmp = self.pca_.components_.T.astype(X.dtype)
        return (X - mean) @ cmp


model = DecorrelateTransformerOnnx()
model.fit(X_train)
print(model.transform(X_test[:5]))

onx = to_onnx(model, X_test[:5], target_opset=14)  # opset=13, 14, ...
print(onx)

>>>

    [[ 0.262 -0.44 ]
     [ 0.418  1.311]
     [ 0.512  1.3  ]
     [-0.599 -1.291]
     [-1.25  -0.296]]
    ir_version: 7
    producer_name: "skl2onnx"
    producer_version: "1.11.1"
    domain: "ai.onnx"
    model_version: 0
    doc_string: ""
    graph {
      node {
        input: "X"
        input: "init"
        output: "out_sub_0"
        name: "_sub__sub"
        op_type: "Sub"
        domain: ""
      }
      node {
        input: "out_sub_0"
        input: "init_1"
        output: "variable1"
        name: "_sub__matmul"
        op_type: "MatMul"
        domain: ""
      }
      node {
        input: "variable1"
        output: "variable"
        name: "_sub_variable"
        op_type: "Identity"
        domain: ""
      }
      name: "mlprodict_ONNX(DecorrelateTransformerOnnx)"
      initializer {
        dims: 2
        data_type: 11
        name: "init"
        raw_data: "\000.\2069\250\025\250\277\324\220\247s3\037\271?"
      }
      initializer {
        dims: 2
        dims: 2
        data_type: 11
        name: "init_1"
        raw_data: "D\231\227\273%\321\214?y\225\226b0\377\357\277y\225\226b0\377\357?D\231\227\273%\321\214?"
      }
      input {
        name: "X"
        type {
          tensor_type {
            elem_type: 11
            shape {
              dim {
              }
              dim {
                dim_value: 2
              }
            }
          }
        }
      }
      output {
        name: "variable"
        type {
          tensor_type {
            elem_type: 11
            shape {
              dim {
              }
              dim {
              }
            }
          }
        }
      }
    }
    opset_import {
      domain: ""
      version: 14
    }

More options #

Use onnxruntime as ONNX runtime #

By default, the ONNX graph is executed by the Python runtime implemented in this module (see Python Runtime for ONNX). It is a mix of numpy and C++ implementations and it does not require any new dependency. However, it is possible to use a different one like onnxruntime which has an implementation for more ONNX Operators. The only change is a wrapper with arguments onnxnumpy_np: @onnxnumpy_np(runtime=’onnxruntime’).

<<<

from typing import Any
import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import FunctionTransformer, StandardScaler
from sklearn.linear_model import LogisticRegression
from onnxruntime import InferenceSession
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_np, NDArray
from mlprodict.onnx_conv import to_onnx


@onnxnumpy_np(runtime='onnxruntime')
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[(None, None), np.float32]:
    return npnx.log(x + np.float32(1))


data = load_iris()
X, y = data.data.astype(np.float32), data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

pipe = make_pipeline(
    FunctionTransformer(onnx_log_1),
    StandardScaler(),
    LogisticRegression())
pipe.fit(X_train, y_train)
print(pipe.predict_proba(X_test[:2]))

onx = to_onnx(pipe, X_train[:1], rewrite_ops=True,
              options={LogisticRegression: {'zipmap': False}})

oinf = InferenceSession(onx.SerializeToString())
print(oinf.run(None, {'X': X_test[:2]})[1])

>>>

    [[9.722e-01 2.775e-02 2.494e-07]
     [9.850e-01 1.501e-02 9.336e-08]]
    [[9.722e-01 2.775e-02 2.494e-07]
     [9.850e-01 1.501e-02 9.336e-08]]
    [runpythonerror]
    /usr/local/lib/python3.9/site-packages/xgboost/compat.py:36: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
      from pandas import MultiIndex, Int64Index

Use a specific ONNX opset #

By default, the ONNX graph generated by the wrapper is using the latest version of ONNX but it is possible to use an older one if the involved runtime does not implement the latest version. The desired opset must be specified in two places, the first time as an argument of onnxnumpy_np, the second time as an argument of to_onnx.

<<<

from typing import Any
import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import FunctionTransformer, StandardScaler
from sklearn.linear_model import LogisticRegression
from onnxruntime import InferenceSession
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_np, NDArray
from mlprodict.onnx_conv import to_onnx

target_opset = 11


@onnxnumpy_np(op_version=target_opset)  # first place
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[(None, None), np.float32]:
    return npnx.log(x + np.float32(1))


data = load_iris()
X, y = data.data.astype(np.float32), data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

pipe = make_pipeline(
    FunctionTransformer(onnx_log_1),
    StandardScaler(),
    LogisticRegression())
pipe.fit(X_train, y_train)
print(pipe.predict_proba(X_test[:2]))

onx = to_onnx(pipe, X_train[:1], rewrite_ops=True,
              options={LogisticRegression: {'zipmap': False}},
              target_opset=target_opset)  # second place

oinf = InferenceSession(onx.SerializeToString())
print(oinf.run(None, {'X': X_test[:2]})[1])

>>>

    [[9.752e-01 2.475e-02 1.282e-07]
     [9.790e-01 2.096e-02 6.769e-07]]
    [[9.752e-01 2.475e-02 1.282e-07]
     [9.790e-01 2.096e-02 6.769e-07]]
    [runpythonerror]
    /usr/local/lib/python3.9/site-packages/xgboost/compat.py:36: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
      from pandas import MultiIndex, Int64Index

Same implementation for float32 and float64 #

Only one input type is allowed by default but there is a way to define a function supporting more than one type with NDArrayType. When calling function onnx_log_1, inputs are detected, an ONNX graph is generated and executed. Next time the same function is called, if the input types are the same as before, it reuses the same ONNX graph and same runtime. Otherwise, it generates a new ONNX graph taking this new type as input. The expression x.dtype returns the type of this input in order to cast the constant 1 into the right type before being used by another operator.

<<<

import numpy as np
from onnxruntime import InferenceSession
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_np, NDArray
from mlprodict.npy.onnx_numpy_annotation import NDArrayType
from mlprodict.onnx_conv import to_onnx


@onnxnumpy_np(signature=NDArrayType('floats'), runtime='onnxruntime')
def onnx_log_1(x):
    return npnx.log(x + x.dtype(1))


x = np.random.rand(2, 3)
y = onnx_log_1(x.astype(np.float32))
print(y.dtype, y)

y = onnx_log_1(x.astype(np.float64))
print(y.dtype, y)

>>>

    float32 [[0.688 0.285 0.588]
     [0.271 0.446 0.039]]
    float64 [[0.688 0.285 0.588]
     [0.271 0.446 0.039]]

There are more options to it. Many of them are used in module npy.numpy_onnx_pyrt. It is possible to add arguments with default values or undefined number of inputs. One important detail though, a different value for an argument (not an input) means the ONNX graph has to be different because this value is stored in the graph instead of being an input. Everytime an input type or an argument is different, a new ONNX graph is generated and executed.

How to convert inplace modifications #

As mentioned earlier, there is no way to modify a tensor inplace. Every modification implies a copy. A modification can be done by creating a new tensor concatenated from other tensors or by using operators Op.ScatterElements or Op.ScatterND. Instruction v[5] = 3.5 is correct with numpy. Class OnnxVar replaces that instruction with operator Op.ScatterElements.

Operator [] (__setitem__) must return the instance itself (self). That’s why the design is different from the other methods. Instead of returning a new instance of OnnxVar, it replaces the only input. However, that require the operator [] to follow a copy. v[5] = 3.5 may not be valid but v = v.copy(); v[5] = 3.5 always is. Current implementation only supports one dimensional tensor. Operators Op.ScatterElements or Op.ScatterND are not really meant to change only one element but to change many of them.

Instructions using slice is also supported: v[:5] = 3.5, v[5:] = 3.5, …

Common errors #

Missing wrapper #

The wrapper intercepts the output of the function and returns a new function with a runtime. The inner function returns an instance of type OnnxVar. It is an layer on the top of ONNX and holds a method doing the conversion to ONNX to_algebra.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(x + np.float32(1))


x = np.random.rand(2, 3).astype(np.float32)
print(onnx_log_1(x))

>>>

    OnnxVar(array([[1.131, 1.134, 1.485],
           [1.202, 1.786, 1.071]], dtype=float32), op=OnnxLog)

The execution does not fail but returns an instance of class OnnxVar. This instance holds all the necessary information to create the ONNX graph.

Missing annotation #

The annotation is needed to determine the input and output types. The runtime would fail executing the ONNX graph without that.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


@onnxnumpy_default
def onnx_log_1(x):
    return npnx.log(x + np.float32(1))

>>>

    
    [runpythonerror]
    Traceback (most recent call last):
      File "<stdin>", line 15, in <module>
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 159, in onnxnumpy_default
        return onnxnumpy()(fct)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 134, in decorator_fct
        compiled = OnnxNumpyCompiler(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 163, in __init__
        self.onnx_ = self._to_onnx(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 379, in _to_onnx
        self._parse_annotation(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 295, in _parse_annotation
        raise RuntimeError(  # pragma: no cover
    RuntimeError: Unable to find annotation for argument 'x'. You should annotate the arguments and the results or specify a signature.

Type mismatch #

As mentioned below, ONNX is strict about types. If ONNX does an addition, it expects to do it with the same types. If types are different, one must be cast into the other one.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


@onnxnumpy_default
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(x + 1)  # -> replace 1 by numpy.float32(1)


x = np.random.rand(2, 3).astype(np.float32)
print(onnx_log_1(x))

>>>

    
    [runpythonerror]
    Traceback (most recent call last):
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 71, in __call__
        return self.compiled(*args, **kwargs)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 520, in __call__
        res = self.rt_fct_(*args, **kwargs)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 85, in __call__
        out = self.rt.run(inp, **kwargs)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnxrt/onnx_inference.py", line 875, in run
        return self._run(inputs, clean_right_away=False,
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnxrt/onnx_inference.py", line 1021, in _run_sequence_runtime
        node.run(values)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnxrt/onnx_inference_node.py", line 365, in run
        res = self.ops_.run(*args)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnxrt/ops_cpu/_op.py", line 815, in run
        res = OpRunBinary.run(self, x, y)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnxrt/ops_cpu/_op.py", line 706, in run
        raise RuntimeTypeError(
    mlprodict.onnxrt.ops_cpu._op.RuntimeTypeError: Input type mismatch: float32 != int64 (operator 'Add', shapes (2, 3), (1,))
    
    The above exception was the direct cause of the following exception:
    
    Traceback (most recent call last):
      File "<stdin>", line 19, in <module>
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 76, in __call__
        raise RuntimeError(
    RuntimeError: Unable to call the compiled version, args is [<class 'numpy.ndarray'>]. kwargs={}.

Shape mismatch #

The signature of the custom function does not specify any output shape but the converter of the next transformer in the pipeline might except one.

<<<

from typing import Any
import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import FunctionTransformer, StandardScaler
from sklearn.linear_model import LogisticRegression
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray
from mlprodict.onnx_conv import to_onnx
from mlprodict.onnxrt import OnnxInference


@onnxnumpy_default
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(x + np.float32(1))


data = load_iris()
X, y = data.data.astype(np.float32), data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

pipe = make_pipeline(
    FunctionTransformer(onnx_log_1),
    StandardScaler(),
    LogisticRegression())
pipe.fit(X_train, y_train)
print(pipe.predict_proba(X_test[:2]))

onx = to_onnx(pipe, X_train[:1], rewrite_ops=True,
              options={LogisticRegression: {'zipmap': False}})

>>>

    [[5.336e-04 1.752e-01 8.243e-01]
     [2.022e-02 8.533e-01 1.265e-01]]
    [runpythonerror]
    /usr/local/lib/python3.9/site-packages/xgboost/compat.py:36: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
      from pandas import MultiIndex, Int64Index

NDArray[Any, np.float32] needs to be replaced by NDArray[(None, None), np.float32] to tell next converter the output is a two dimension array.

Missing converter #

The default converter for FunctionTransformer implemented in sklearn-onnx does not support custom functions, only identity, which defeats the purpose of using such preprocessing. The conversion fails unless the default converter is replaced by a new one supporting custom functions implemented this API.

<<<

from typing import Any
import numpy as np
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import FunctionTransformer, StandardScaler
from sklearn.linear_model import LogisticRegression
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray
from mlprodict.onnx_conv import to_onnx
from mlprodict.onnxrt import OnnxInference


@onnxnumpy_default
def onnx_log_1(x: NDArray[Any, np.float32]) -> NDArray[(None, None), np.float32]:
    return npnx.log(x + np.float32(1))


data = load_iris()
X, y = data.data.astype(np.float32), data.target
X_train, X_test, y_train, y_test = train_test_split(X, y)

pipe = make_pipeline(
    FunctionTransformer(onnx_log_1),
    StandardScaler(),
    LogisticRegression())
pipe.fit(X_train, y_train)
onx = to_onnx(pipe, X_train[:1],
              options={LogisticRegression: {'zipmap': False}})

>>>

    
    [runpythonerror]
    Traceback (most recent call last):
      File "<stdin>", line 34, in <module>
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/onnx_conv/convert.py", line 483, in to_onnx
        res = convert_sklearn(model, initial_types=initial_types, name=name,
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/convert.py", line 184, in convert_sklearn
        onnx_model = convert_topology(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/common/_topology.py", line 1421, in convert_topology
        topology.convert_operators(container=container, verbose=verbose)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/common/_topology.py", line 1255, in convert_operators
        self.call_shape_calculator(operator)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/common/_topology.py", line 1091, in call_shape_calculator
        operator.infer_types()
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/common/_topology.py", line 606, in infer_types
        shape_calc(self)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_venv/lib/python3.9/site-packages/skl2onnx/shape_calculators/function_transformer.py", line 14, in calculate_sklearn_function_transformer_output_shapes
        raise RuntimeError("FunctionTransformer is not supported unless the "
    RuntimeError: FunctionTransformer is not supported unless the transform function is None (= identity). You may raise an issue at https://github.com/onnx/sklearn-onnx/issues.

There are a couple of ways to fix this example. One way is to call to_onnx function with argument rewrite_ops=True. The function restores the default converter after the call. Another way is to call function register_rewritten_operators but changes are permanent.

Issue when an estimator is called by another one #

A new class is created and the method transform is implemented with the numpy API for ONNX. This function must produce an ONNX graph including the embedded the embedded model. It must call the converter for this estimator to get that graph. That what instruction nxnpskl.transformer(X, model=self.estimator_) does. However it produces the following error.

<<<

import numpy
from sklearn.base import TransformerMixin, BaseEstimator
from sklearn.preprocessing import StandardScaler
from mlprodict.onnx_conv import to_onnx
from mlprodict.onnxrt import OnnxInference
from mlprodict.npy import onnxsklearn_class
import mlprodict.npy.numpy_onnx_impl_skl as nxnpskl


@onnxsklearn_class("onnx_graph")
class CustomTransformerOnnx(TransformerMixin, BaseEstimator):

    def __init__(self, base_estimator):
        TransformerMixin.__init__(self)
        BaseEstimator.__init__(self)
        self.base_estimator = base_estimator

    def fit(self, X, y, sample_weights=None):
        if sample_weights is not None:
            raise NotImplementedError(
                "weighted sample not implemented in this example.")

        self.estimator_ = self.base_estimator.fit(  # pylint: disable=W0201
            X, y, sample_weights)
        return self

    def onnx_graph(self, X):
        return nxnpskl.transformer(X, model=self.estimator_)


X = numpy.random.randn(20, 2).astype(numpy.float32)
y = ((X.sum(axis=1) + numpy.random.randn(
     X.shape[0]).astype(numpy.float32)) >= 0).astype(numpy.int64)
dec = CustomTransformerOnnx(StandardScaler())
dec.fit(X, y)
onx = to_onnx(dec, X.astype(numpy.float32))
oinf = OnnxInference(onx)
tr = dec.transform(X)  # pylint: disable=E1101
got = oinf.run({'X': X})
print(got)

>>>

    {'variable': array([[-0.764,  0.762],
           [-1.585, -1.715],
           [-0.314, -0.751],
           [-0.367, -0.494],
           [ 1.297,  0.747],
           [ 2.002, -0.84 ],
           [-0.754,  0.375],
           [ 0.91 , -1.197],
           [ 1.317,  0.781],
           [ 0.133,  0.703],
           [-1.252, -0.523],
           [-0.396,  1.359],
           [ 0.197,  1.69 ],
           [ 0.33 ,  0.605],
           [-0.645,  0.544],
           [-1.039,  0.315],
           [ 0.842,  0.003],
           [-1.41 ,  0.216],
           [ 1.317, -2.434],
           [ 0.182, -0.147]], dtype=float32)}
    [runpythonerror]
    /usr/local/lib/python3.9/site-packages/xgboost/compat.py:36: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
      from pandas import MultiIndex, Int64Index

To fix it, instruction return nxnpskl.transformer(X, model=self.estimator_) should be replaced by return nxnpskl.transformer(X, model=self.estimator_).copy().

TypeError: unsupported operand type(s) for ** or pow(): ‘float’ and ‘OnnxVar’#

The following example works because operator __radd__ was overwritten in class @see cl OnnxVar.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


def np_fct(x):
    return np.log(1 + x)


@onnxnumpy_default
def onnx_fct(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(1. + x)


x = np.random.rand(2, 1).astype(np.float32)
print(np_fct(x), onnx_fct(x))

>>>

    [[0.254]
     [0.621]] [[0.254]
     [0.621]]

But it is not the case for all operators.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


def np_fct(x):
    return np.log(2 ** x)


@onnxnumpy_default
def onnx_fct(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(2. ** x)


x = np.random.rand(2, 1).astype(np.float32)
print(np_fct(x), onnx_fct(x))

>>>

    
    [runpythonerror]
    Traceback (most recent call last):
      File "<stdin>", line 18, in <module>
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 159, in onnxnumpy_default
        return onnxnumpy()(fct)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 134, in decorator_fct
        compiled = OnnxNumpyCompiler(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 163, in __init__
        self.onnx_ = self._to_onnx(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 402, in _to_onnx
        onx_var = self.fct_(*names_var, **kwargs)
      File "<stdin>", line 19, in onnx_fct
    TypeError: unsupported operand type(s) for ** or pow(): 'float' and 'OnnxVar'

Python calls the operator float.__pow__ and not OnnxVar.__pow__. That explains the error. Function @see fct cst can be used to convert a constant into an @see cl OnnxVar. The right operator is called.

<<<

from typing import Any
import numpy as np
import mlprodict.npy.numpy_onnx_impl as npnx
from mlprodict.npy import onnxnumpy_default, NDArray


def np_fct(x):
    return np.log(2 ** x)


@onnxnumpy_default
def onnx_fct(x: NDArray[Any, np.float32]) -> NDArray[Any, np.float32]:
    return npnx.log(2. ** x)


x = np.random.rand(2, 1).astype(np.float32)
print(np_fct(x), onnx_fct(x))

>>>

    
    [runpythonerror]
    Traceback (most recent call last):
      File "<stdin>", line 18, in <module>
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 159, in onnxnumpy_default
        return onnxnumpy()(fct)
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_wrapper.py", line 134, in decorator_fct
        compiled = OnnxNumpyCompiler(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 163, in __init__
        self.onnx_ = self._to_onnx(
      File "/var/lib/jenkins/workspace/mlprodict/mlprodict_UT_39_std/_doc/sphinxdoc/source/mlprodict/npy/onnx_numpy_compiler.py", line 402, in _to_onnx
        onx_var = self.fct_(*names_var, **kwargs)
      File "<stdin>", line 19, in onnx_fct
    TypeError: unsupported operand type(s) for ** or pow(): 'float' and 'OnnxVar'

Create custom ONNX graphs with AST

Xop API