LogSoftmax - 11 vs 13#

Next section compares an older to a newer version of the same operator after both definition are converted into markdown text. Green means an addition to the newer version, red means a deletion. Anything else is unchanged.

Files changed (1) hide show

LogSoftmax11 → LogSoftmax13 +19 -10

LogSoftmax11 → LogSoftmax13 RENAMED Viewed

@@ -1 +1 @@
- The operator computes the log of softmax values for the given input:
+ The operator computes the logsoftmax (log of softmax) values for each layer in the batch
+  of the given input.
+ The input does not need to explicitly be a 2D vector; rather, it will be
-  LogSoftmax(input, axis) = Log(Softmax(input, axis=axis))
+ coerced into one. For an arbitrary n-dimensional tensor
+ input in [a_0, a_1, ..., a_{k-1}, a_k, ..., a_{n-1}] and k is
- The "axis" attribute indicates the dimension along which LogSoftmax
+ the axis provided, then input will be coerced into a 2-dimensional tensor with
+ dimensions [a_0 * ... * a_{k-1}, a_k * ... * a_{n-1}]. For the default
+ case where axis=1, this means the input tensor will be coerced into a 2D tensor
+ of dimensions [a_0, a_1 * ... * a_{n-1}], where a_0 is often the batch size.
+ In this situation, we must have a_0 = N and a_1 * ... * a_{n-1} = D.
+ Each of these dimensions must be matched correctly, or else the operator
- will be performed. The output tensor has the same shape
+ will throw errors. The output tensor has the same shape
- and contains the LogSoftmax values of the corresponding input.
+ and contains the logsoftmax values of the corresponding input.
  **Attributes**
  * **axis**:
-    Describes the dimension LogSoftmax will be performed on. Negative
+   Describes the axis of the inputs when coerced to 2D; defaults to one
+   because the 0th axis most likely describes the batch_size. Negative
    value means counting dimensions from the back. Accepted range is
    [-r, r-1] where r = rank(input).
  **Inputs**
  * **input** (heterogeneous) - **T**:
-   The input tensor of rank >= axis.
+   The input tensor that's coerced into a 2D matrix of size (NxD) as
+   described above.
  **Outputs**
  * **output** (heterogeneous) - **T**:
-   The output values with the same shape as the input tensor.
+   The output values with the same shape as input tensor (the original
+   size without coercion).
  **Type Constraints**
  * **T** in (
-   tensor(bfloat16),
    tensor(double),
    tensor(float),
    tensor(float16)
    ):
    Constrain input and output types to float tensors.