Skip to content

[amd] MI350 tests are failing #1036

@xuzhao9

Description

@xuzhao9

Last good commit: f7c1d69401e9f09050451f30776562954b05e850
Pytorch: 2.12.0.dev20260412+rocm7.2
Last good workflow: https://github.com/meta-pytorch/tritonbench/actions/runs/24361986643/job/71143820345

First bad commit: f7c1d69401e9f09050451f30776562954b05e850
Pytorch: 2.12.0.dev20260412+rocm7.2
First bad workflow: https://github.com/meta-pytorch/tritonbench/actions/runs/24363474649/job/71148992904

Plan to run a bisect to pinpoint the rest of the failures.

Tring to bisect on test_gpu_tritonbench_grouped_gemm:

Reproduce command:

TORCHINDUCTOR_COMPILE_THREADS=1 python -m unittest test.test_gpu.main -k test_gpu_tritonbench_grouped_gemm

Last good Triton commit: f7c1d69401e9f09050451f30776562954b05e850
First bad Triton commit: ce5391b2a49839a5f8f018f4ff9941559e3ce624

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions