MaxPool - 1 vs 11#

Next section compares an older to a newer version of the same operator after both definition are converted into markdown text. Green means an addition to the newer version, red means a deletion. Anything else is unchanged.

Files changed (1) hide show
  1. MaxPool1 → MaxPool11 +5 -35
MaxPool1 → MaxPool11 RENAMED
@@ -1 +1 @@
1
1
  MaxPool consumes an input tensor X and applies max pooling across
2
2
  the tensor according to kernel sizes, stride sizes, and pad lengths.
3
3
  max pooling consisting of computing the max on all values of a
4
4
  subset of the input tensor according to the kernel size and downsampling the
5
5
  data into the output tensor Y for further processing. The output spatial shape will be following:
6
6
  ::
7
- output_spatial_shape[i] = floor((input_spatial_shape[i] + pad_shape[i] - ((kernel_spatial_shape[i] - 1) * dilations[i] + 1)) / strides_spatial_shape[i] + 1)
7
+ output_spatial_shape[i] = floor((input_spatial_shape[i] + pad_shape[i] - kernel_spatial_shape[i]) / strides_spatial_shape[i] + 1)
8
-
9
- or
10
- ::
11
-
12
- output_spatial_shape[i] = ceil((input_spatial_shape[i] + pad_shape[i] - ((kernel_spatial_shape[i] - 1) * dilations[i] + 1)) / strides_spatial_shape[i] + 1)
13
-
14
- if ceil_mode is enabled
15
-
16
- ::
17
8
  * pad_shape[i] is sum of pads along axis i
18
9
  auto_pad is a DEPRECATED attribute. If you are using them currently, the output spatial shape will be following:
19
10
  ::
20
- VALID: output_spatial_shape[i] = ceil((input_spatial_shape[i] - ((kernel_spatial_shape[i] - 1) * dilations[i] + 1) + 1) / strides_spatial_shape[i])
11
+ VALID: output_spatial_shape[i] = ceil((input_spatial_shape[i] - kernel_spatial_shape[i] + 1) / strides_spatial_shape[i])
21
12
  SAME_UPPER or SAME_LOWER: output_spatial_shape[i] = ceil(input_spatial_shape[i] / strides_spatial_shape[i])
22
13
  And pad shape will be following if SAME_UPPER or SAME_LOWER:
23
14
  ::
24
- pad_shape[i] = (output_spatial_shape[i] - 1) * strides_spatial_shape[i] + ((kernel_spatial_shape[i] - 1) * dilations[i] + 1) - input_spatial_shape[i]
15
+ pad_shape[i] = (output_spatial_shape[i] - 1) * strides_spatial_shape[i] + kernel_spatial_shape[i] - input_spatial_shape[i]
25
16
  The output of each pooling window is maximum number of elements exclude pad.
26
17
  **Attributes**
27
18
  * **auto_pad**:
28
19
  auto_pad must be either NOTSET, SAME_UPPER, SAME_LOWER or VALID.
29
20
  Where default value is NOTSET, which means explicit padding is used.
30
21
  SAME_UPPER or SAME_LOWER mean pad the input so that the output
31
22
  spatial size match the input.In case of odd number add the extra
32
23
  padding at the end for SAME_UPPER and at the beginning for
33
24
  SAME_LOWER. VALID mean no padding.
34
- * **ceil_mode**:
35
- Whether to use ceil or floor (default) to compute the output shape.
36
- * **dilations**:
37
- Dilation value along each spatial axis of filter. If not present,
38
- the dilation defaults to 1 along each spatial axis.
39
25
  * **kernel_shape** (required):
40
26
  The size of the kernel along each axis.
41
27
  * **pads**:
42
28
  Padding for the beginning and ending along each spatial axis, it can
43
29
  take any value greater than or equal to 0. The value represent the
44
30
  number of pixels added to the beginning and end part of the
45
31
  corresponding axis. pads format should be as follow [x1_begin,
46
32
  x2_begin...x1_end, x2_end,...], where xi_begin the number of pixels
47
33
  added at the beginning of axis i and xi_end, the number of pixels
48
34
  added at the end of axis i. This attribute cannot be used
49
35
  simultaneously with auto_pad attribute. If not present, the padding
50
36
  defaults to 0 along start and end of each spatial axis.
51
- * **storage_order**:
52
- The storage order of the tensor. 0 is row major, and 1 is column
53
- major.
54
37
  * **strides**:
55
- Stride along each spatial axis. If not present, the stride defaults
56
- to 1 along each spatial axis.
38
+ Stride along each spatial axis.
57
39
  **Inputs**
58
40
  * **X** (heterogeneous) - **T**:
59
41
  Input data tensor from the previous operator; dimensions for image
60
42
  case are (N x C x H x W), where N is the batch size, C is the number
61
43
  of channels, and H and W are the height and the width of the data.
62
44
  For non image case, the dimensions are in the form of (N x C x D1 x
63
45
  D2 ... Dn), where N is the batch size. Optionally, if dimension
64
46
  denotation is in effect, the operation expects the input data tensor
65
47
  to arrive with the dimension denotation of [DATA_BATCH,
66
48
  DATA_CHANNEL, DATA_FEATURE, DATA_FEATURE ...].
67
49
  **Outputs**
68
- Between 1 and 2 outputs.
69
-
70
50
  * **Y** (heterogeneous) - **T**:
71
51
  Output data tensor from average or max pooling across the input
72
52
  tensor. Dimensions will vary based on various kernel, stride, and
73
53
  pad sizes. Floor value of the dimension is used
74
- * **Indices** (optional, heterogeneous) - **I**:
75
- Indices tensor from max pooling across the input tensor. The
76
- dimensions of indices are the same as output tensor. The values in
77
- indices of are the indices of the selected values during pooling.
78
- The indices are computed as flatten 1-D tensor, and the indices do
79
- not consider padding. So the values in indices are in [0, N x C x D1
80
- x ... x Dn).
81
54
  **Type Constraints**
82
55
  * **T** in (
83
56
  tensor(double),
84
57
  tensor(float),
85
58
  tensor(float16)
86
59
  ):
87
- Constrain input and output types to float tensors.
60
+ Constrain input and output types to float tensors.- * **I** in (
88
- tensor(int64)
89
- ):
90
- Constrain index tensor to int64