[inductor] Fix a bmm related bug #532

desertfire · 2022-07-08T19:18:23Z

Summary: The bug is exposed with pytorch_struct training run. We need to
be conservative on the optimization for bs=1 bmm, when any operand is not consecutive.

Summary: The bug is exposed with pytorch_struct training run. We need to be conservative on the optimization for bs=1 bmm, when any operand is not consecutive.

desertfire · 2022-07-08T19:29:33Z

torchinductor/ir.py

+            b3 == 1
+            and is_contiguous_storage_and_layout(a)
+            and is_contiguous_storage_and_layout(b)
+        ):
            # convert to normal mm
            data = MatrixMultiply(
                layout=output_layout.as_fixed(),


@jansel I guess another option is to generate a ReinterpretView instead of View when the batch size is 1, but I am not sure if that is safe to do here.

ngimel · 2022-07-08T19:54:21Z

torchinductor/ir.py

-        if b3 == 1:
+        if (
+            b3 == 1
+            and is_contiguous_storage_and_layout(a)


this doesn't look right, MatrixMultiply should be able to handle transposed inputs

The problem is because View.create does not return a ReinterpretView because of the checking at

torchdynamo/torchinductor/ir.py

Line 695 in 044e8bd

if is_contiguous_storage_and_layout(x):

, and thus later failed an assertion at

torchdynamo/torchinductor/ir.py

Line 1570 in 044e8bd

assert isinstance(x, (Buffer, ReinterpretView)), x

.

I think making a special case in View.create might work. @jansel , any suggestion?

Hrm, in this case perhaps we should be treating it as contiguous, and ignore size=1 dims in contiguous checks. The bs=1 doesn't actually matter.

size=1 is already not checked, but one (or both) of the 2d matrices resulting from removing batch dimension might not be contiguous, and thus View.create returns View and not ReinterpretView. I think we can return ReinterpretView in more cases by specifying proper strides in the new_layout

I think we should extend SqueezeView.create to accept a specific dimension, and use Squeezeview.create instead of View.create below.

jansel

I agree with @ngimel here we should be able to map bmm to mm in the transposed case. It might be a bit annoying to do that because not all of the view types work with extern kernels without adding a copy.

Accepting because this fixes a bug, and converting bmm to mm is a minor optimization that it is likely ok to miss.

ngimel · 2022-07-10T21:25:13Z

Here's what I had in mind #548, it still allows bmm -> mm optimization for discontiguous inputs, and won't cause more copies than bmm itself would have caused.

[inductor] Fix a bmm related bug

c5efd92

Summary: The bug is exposed with pytorch_struct training run. We need to be conservative on the optimization for bs=1 bmm, when any operand is not consecutive.

facebook-github-bot added the cla signed label Jul 8, 2022

jansel approved these changes Jul 8, 2022

View reviewed changes

desertfire commented Jul 8, 2022

View reviewed changes

desertfire requested review from jansel and ngimel July 8, 2022 19:29

desertfire mentioned this pull request Jul 8, 2022

[inductor] pytorch_struct training failure #530

Closed

ngimel reviewed Jul 8, 2022

View reviewed changes

jansel approved these changes Jul 10, 2022

View reviewed changes

ngimel mentioned this pull request Jul 10, 2022

fix bmm -> mm optimization error #548

Merged

jansel closed this Jul 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] Fix a bmm related bug #532

[inductor] Fix a bmm related bug #532

desertfire commented Jul 8, 2022

desertfire Jul 8, 2022

ngimel Jul 8, 2022

desertfire Jul 8, 2022

jansel Jul 9, 2022

ngimel Jul 9, 2022

ngimel Jul 9, 2022

jansel left a comment

ngimel commented Jul 10, 2022

[inductor] Fix a bmm related bug #532

[inductor] Fix a bmm related bug #532

Conversation

desertfire commented Jul 8, 2022

desertfire Jul 8, 2022

Choose a reason for hiding this comment

ngimel Jul 8, 2022

Choose a reason for hiding this comment

desertfire Jul 8, 2022

Choose a reason for hiding this comment

jansel Jul 9, 2022

Choose a reason for hiding this comment

ngimel Jul 9, 2022

Choose a reason for hiding this comment

ngimel Jul 9, 2022

Choose a reason for hiding this comment

jansel left a comment

Choose a reason for hiding this comment

ngimel commented Jul 10, 2022