x64: Update codegen of `XmmCmove` pseudo-inst by alexcrichton · Pull Request #10839 · bytecodealliance/wasmtime

alexcrichton · 2025-05-27T15:47:59Z

In #4317 this instruction was updated to handle 128-bit vectors in addition to the previous handling of 32/64-bit floats. Originally the pseudo-instruction used movs{s,d} to achieve its task and when adding 128-bit support I mistakenly switched both f32/f64 paths to using movsd instead of conditionally using movss for f32. In retrospect though it's probably best to use a full register move here instead of just a singular mov because movss and movsd preserve the upper bits of the register, needlessly creating a data dependency with the previous value in the register.

This commit updates this helper to using Inst::gen_move which already internally does this optimization of using movaps, a documented zero-latency instruction, for all xmm-style register movements.

In bytecodealliance#4317 this instruction was updated to handle 128-bit vectors in addition to the previous handling of 32/64-bit floats. Originally the pseudo-instruction used `movs{s,d}` to achieve its task and when adding 128-bit support I mistakenly switched both f32/f64 paths to using `movsd` instead of conditionally using `movss` for `f32`. In retrospect though it's probably best to use a full register move here instead of just a singular mov because `movss` and `movsd` preserve the upper bits of the register, needlessly creating a data dependency with the previous value in the register. This commit updates this helper to using `Inst::gen_move` which already internally does this optimization of using `movaps`, a documented zero-latency instruction, for all xmm-style register movements.

alexcrichton requested review from a team as code owners May 27, 2025 15:48

alexcrichton requested review from cfallin and dicej and removed request for a team May 27, 2025 15:48

alexcrichton mentioned this pull request May 27, 2025

x64: Migrate xmm mov-family instructions to new assembler #10834

Merged

cfallin approved these changes May 27, 2025

View reviewed changes

cfallin added this pull request to the merge queue May 27, 2025

Merged via the queue into bytecodealliance:main with commit 55c6e16 May 27, 2025
53 checks passed

alexcrichton deleted the x64-new-xmm-cmove branch May 27, 2025 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x64: Update codegen of `XmmCmove` pseudo-inst#10839

x64: Update codegen of `XmmCmove` pseudo-inst#10839
cfallin merged 1 commit into
bytecodealliance:mainfrom
alexcrichton:x64-new-xmm-cmove

alexcrichton commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alexcrichton commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants