aarch64: Add support for the `fmls` instruction by alexcrichton · Pull Request #5895 · bytecodealliance/wasmtime

alexcrichton · 2023-02-28T22:49:44Z

This commit adds lowerings to the AArch64 backend for the fmls instruction which is intended to be leveraged in the relaxed-simd proposal for WebAssembly. This should hopefully allow for a teeny-bit-more efficient codegen for this operator instead of using the fmla instruction plus a negation instruction.

This commit adds lowerings to the AArch64 backend for the `fmls` instruction which is intended to be leveraged in the relaxed-simd proposal for WebAssembly. This should hopefully allow for a teeny-bit-more efficient codegen for this operator instead of using the `fmla` instruction plus a negation instruction.

jameysharp

Nice!

jameysharp · 2023-03-02T02:25:39Z

+      (vec_rrr_mod (VecALUModOp.Fmls) z x y (vector_size ty)))
+
+(rule 2 (lower (has_type ty @ (multi_lane _ _) (fma x (fneg y) z)))
+      (vec_rrr_mod (VecALUModOp.Fmls) z x y (vector_size ty)))


I suppose if both x and y are fneg then this can emit fmla instead of fneg+fmls, right? But I guess that's a rewrite we ought to do in the egraph optimizations instead.

Indeed! The x64 rules actually end up implementing that (they enable sort of switching back and forth given their structure) but it wasn't as obvious to do here - x64 uses a helper that manages sinking a load as well which adds a fair number of permutations.

I'll send a follow-up which implements the egraph optimization.

This implements comments from bytecodealliance#5895 to cancel out `fneg` operations in `fma` instructions. Additional support for `fmul` is added as well.

This implements comments from #5895 to cancel out `fneg` operations in `fma` instructions. Additional support for `fmul` is added as well.

alexcrichton mentioned this pull request Feb 28, 2023

Implement the relaxed SIMD proposal #5892

Merged

github-actions Bot added cranelift Issues related to the Cranelift code generator cranelift:area:aarch64 Issues related to AArch64 backend. labels Feb 28, 2023

jameysharp approved these changes Mar 2, 2023

View reviewed changes

alexcrichton added this pull request to the merge queue Mar 2, 2023

alexcrichton mentioned this pull request Mar 2, 2023

Add egraph optimization for fneg's cancelling out #5910

Merged

Merged via the queue into bytecodealliance:main with commit 9984e95 Mar 2, 2023

alexcrichton deleted the fmls branch March 2, 2023 06:51

alexcrichton added a commit that referenced this pull request Mar 2, 2023

Add egraph optimization for fneg's cancelling out (#5910)

3ff3994

This implements comments from #5895 to cancel out `fneg` operations in `fma` instructions. Additional support for `fmul` is added as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aarch64: Add support for the `fmls` instruction#5895

aarch64: Add support for the `fmls` instruction#5895
alexcrichton merged 1 commit into
bytecodealliance:mainfrom
alexcrichton:fmls

alexcrichton commented Feb 28, 2023

Uh oh!

jameysharp left a comment

Uh oh!

jameysharp Mar 2, 2023

Uh oh!

alexcrichton Mar 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alexcrichton commented Feb 28, 2023

Uh oh!

jameysharp left a comment

Choose a reason for hiding this comment

Uh oh!

jameysharp Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

alexcrichton Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants