[WIP,Cute,Flex,Sm100] vectorized mask mod application by reubenconducts · Pull Request #2261 · Dao-AILab/flash-attention

reubenconducts · 2026-02-17T03:06:10Z

Follow-up to #2236. The approach to vectorizing is bipartite:

Vectorize mask application, to compile down to r2p
Vectorize mask evaluation

The latter is important for example in situations where mask_mod depends on aux_tensors that are contiguous in the kv idx, or when aux_tensors don't depend on kv index at all.

mask_mods still emit TensorSSAs, but they need not be single values. These are treated as bit-packed masks.

cc @drisspg

drisspg · 2026-02-28T20:39:29Z

+            #   2: application, where it is applied to compile down to r2p 
+            #
+            # evaluation
+            num_mask_vals = (ncol + 32 - 1) // 32 


I think we need R2P width global constant, this is where the 32 comes from right?

or I guess its vecsize?

the 32 does come from the R2P width yes

actually sorry that's not the case - it was just my choice to keep the bitmask in Uint32s

drisspg

couple clarifying questions but this looks good, I just put up autotuning PR: pytorch/pytorch#176055

helps alot in some cases

vectorized mask mod application for existing mask mod signatures

ca1affb

drisspg reviewed Feb 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP,Cute,Flex,Sm100] vectorized mask mod application#2261

[WIP,Cute,Flex,Sm100] vectorized mask mod application#2261
reubenconducts wants to merge 1 commit intoDao-AILab:mainfrom
reubenconducts:rstern/mask-vec

reubenconducts commented Feb 17, 2026 •

edited

Loading

Uh oh!

drisspg Feb 28, 2026

Uh oh!

drisspg Feb 28, 2026

Uh oh!

reubenconducts Feb 28, 2026

Uh oh!

reubenconducts Feb 28, 2026

Uh oh!

drisspg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

reubenconducts commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drisspg Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

reubenconducts Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

reubenconducts Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

reubenconducts commented Feb 17, 2026 •

edited

Loading