[TOPI] Add support for groupped conv3d by tkonolige · Pull Request #9873 · apache/tvm

tkonolige · 2022-01-07T23:31:54Z

Change conv3d to use generic conv implementation which supports grouped convolutions. Also, remove support for non-float16 tensorcore operations as they cause large degradation in accuracy. Generic conv now supports autoscheduler.

@mbrookhart @jwfromm

mbrookhart

A couple of nits. I'd love it if @masahi or @jwfromm could take a look re: tensorcores, but otherwise, I'm happy.

src/target/source/codegen_cuda.cc

mbrookhart · 2022-01-18T21:20:50Z

src/arith/rewrite_simplify.cc

+    if ((floordiv(x, y)).Match(ret) && analyzer_->CanProve(x.Eval() < y.Eval())) {
+      return 0;
+    }
+


I'm not sure what this is for?

Maybe need to a a pass test for this.

This is needed so that the simplifier here: https://github.com/apache/tvm/pull/9873/files#diff-04a1aab966320b6f63c390cc8b79f79b543a1a7a3f560ea3a97d18c2bb041a0aR837-R839 will remove the division by groups. Without this, simplification get stuck simplifying ff // (num_filter // groups) * (in_channel // groups) + rc to ff // num_filter * in_channel + rc. It can't prove that ff // num_filter = 0.

masahi · 2022-01-18T21:30:09Z

Also, remove support for non-float16 tensorcore operations as they cause large degradation in accuracy

Yeah, I also noticed this implicit downcasting before, I'm in favor of this change. Users can always explicitly cast their inputs to fp16. The original code was added before we have the fp16 conversion pass.

This may break existing flows if some users depend on this behavior, which is also the default in cudnn/cublas. Personally I don't mind it, but if that's concerning we can always add a workaround to allow the old behavior.

python/tvm/relay/op/strategy/cuda.py

Change conv3d to use generic conv implementation which supports groupped convolutions. Also, remove support for non-float16 tensorcore operations as they cause large degradation in accuracy. Generic conv now supports autoscheduler.

tkonolige · 2022-02-17T16:14:39Z

@masahi @mbrookhart I've finally got this branch green. Could you re-review?

mbrookhart

LGTM. I would love to align on the tensorcore issues across the ops, if any still remain, but that can be a second PR.

* [TOPI] Add support for groupped conv3d Change conv3d to use generic conv implementation which supports groupped convolutions. Also, remove support for non-float16 tensorcore operations as they cause large degradation in accuracy. Generic conv now supports autoscheduler. * correct none check * add tests for floordiv simplification * fixed incorrect test for autoscheduler * formatting * add groups to winograd * fix tensorcore * manually simplify index instead of relying on simplifier * formatting * add groups argument to conv3d_ncdhw_winograd_without_weight_transform * formatting

tkonolige force-pushed the group_conv3d branch from e2e423d to 19fd9cc Compare January 11, 2022 00:50

tkonolige force-pushed the group_conv3d branch from 8a06f18 to 31c4443 Compare January 18, 2022 19:58

mbrookhart reviewed Jan 18, 2022

View reviewed changes

masahi reviewed Jan 18, 2022

View reviewed changes

python/tvm/relay/op/strategy/cuda.py Show resolved Hide resolved

tkonolige force-pushed the group_conv3d branch from bd63426 to f4f2908 Compare January 20, 2022 22:34

tkonolige force-pushed the group_conv3d branch from f4f2908 to 7928fc7 Compare January 31, 2022 17:31

Tristan Konolige added 4 commits February 9, 2022 11:04

[TOPI] Add support for groupped conv3d

025bcf8

Change conv3d to use generic conv implementation which supports groupped convolutions. Also, remove support for non-float16 tensorcore operations as they cause large degradation in accuracy. Generic conv now supports autoscheduler.

correct none check

a9e0b3d

add tests for floordiv simplification

2d6917d

fixed incorrect test for autoscheduler

58ae0fa

tkonolige force-pushed the group_conv3d branch from d070fb9 to 58ae0fa Compare February 9, 2022 19:05

Tristan Konolige added 7 commits February 9, 2022 12:07

formatting

2976e37

add groups to winograd

71b5a76

fix tensorcore

ad5486b

manually simplify index instead of relying on simplifier

0962bd2

formatting

077d2d8

add groups argument to conv3d_ncdhw_winograd_without_weight_transform

0f1635c

formatting

229d1a4

mbrookhart approved these changes Feb 18, 2022

View reviewed changes

masahi approved these changes Feb 18, 2022

View reviewed changes

masahi merged commit 2c0a7c2 into apache:main Feb 18, 2022

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] Add support for groupped conv3d#9873

[TOPI] Add support for groupped conv3d#9873
masahi merged 11 commits intoapache:mainfrom
tkonolige:group_conv3d

tkonolige commented Jan 7, 2022

Uh oh!

mbrookhart left a comment

Uh oh!

Uh oh!

mbrookhart Jan 18, 2022

Uh oh!

mbrookhart Jan 18, 2022

Uh oh!

tkonolige Jan 18, 2022

Uh oh!

masahi commented Jan 18, 2022 •

edited

Loading

Uh oh!

Uh oh!

tkonolige commented Feb 17, 2022

Uh oh!

mbrookhart left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

tkonolige commented Jan 7, 2022

Uh oh!

mbrookhart left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mbrookhart Jan 18, 2022

Choose a reason for hiding this comment

Uh oh!

mbrookhart Jan 18, 2022

Choose a reason for hiding this comment

Uh oh!

tkonolige Jan 18, 2022

Choose a reason for hiding this comment

Uh oh!

masahi commented Jan 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tkonolige commented Feb 17, 2022

Uh oh!

mbrookhart left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

masahi commented Jan 18, 2022 •

edited

Loading