RTX 5070 (sm_120) – AUTOMATIC1111 WebUI fails on GPU, CPU works fine. Looking for a working build / dev branch. #17266

ArienM02 · 2026-01-27T06:16:53Z

ArienM02
Jan 27, 2026

I’m having trouble getting AUTOMATIC1111 Stable Diffusion WebUI to run on my GPU.

CPU mode works fine and can generate images, but any attempt to use CUDA on my RTX 5070 fails, including model loading.

I’m trying to figure out whether I’m using the wrong WebUI branch, or there is a DEV / experimental build that actually works on RTX 50-series (sm_120), or this is a current limitation of Windows PyTorch builds.

What works

WebUI launches normally
CPU-only inference works and can generate images
Models load correctly only in CPU mode

What does NOT work

Any CUDA operation fails with:
CUDA error: no kernel image is available for execution on the device
Models fail to load on GPU (fails during torch tensor initialization)

This happens even with:

PyTorch nightly (cu128)
xformers disabled
SDPA attention
clean venv
CUDA Toolkit installed

I’ve confirmed this is not just an A1111 issue — even a basic test like:
torch.randn(16, device="cuda")
fails with the same error.

About the “DEV mode” WebUI

I tried using a WebUI fork/branch that advertises DEV mode / RTX 50-series support, but:

It requires logging into GitHub to download missing components
The files it tries to fetch no longer exist or are unavailable
So that build is unusable in its current state

If there is a working DEV branch, a fork that already supports sm_120, or a known-good commit / release

I’d really appreciate a link.

System Specs

GPU: NVIDIA RTX 5070 (12GB, sm_120)
CPU: AMD Ryzen 9 7900X (12-core)
OS: Windows 11
Driver: NVIDIA Studio Driver 591.74
Driver CUDA Capability: 13.1 (per nvidia-smi)
CUDA Toolkit: 12.8 installed
Python: 3.10.6
PyTorch tested: Stable cu128 and Nightly cu128 (torch 2.11 dev)

What I’m trying to find out

Is there a fully working WebUI version or fork that supports RTX 50-series on Windows?
Is this a known limitation of Windows PyTorch binaries for sm_120?
Are people with RTX 50-series currently using: a specific fork, WSL2/Linux, or DirectML instead?

If I’m missing any important system info, please let me know and I’ll add it.

Thanks in advance — I’m happy to test things and report back.

sageliebi-maker · 2026-02-04T14:18:18Z

sageliebi-maker
Feb 4, 2026

What is currently happening (cause of the problem)
Reason for the error

You get:
CUDA error: no kernel image is available for execution on the device

This means: PyTorch has not compiled any CUDA kernels for sm_120 (Blackwell Architecture of the RTX 50 series) that it can run on your GPU. This isn't a fault of your installation alone — it's the way PyTorch binaries are built.

As of now, many official PyTorch wheels for Windows do not yet fully support sm_120 — especially the stable releases that install WebUI by default.

This means specifically:
Even if CUDA is installed — and your GPU is detected there — the required CUDA kernels are not included in the PyTorch build.

That's why a simple torch.randn(…, device="cuda") fails with the same error.

So it's not a WebUI bug per se, but a compatibility issue between PyTorch + CUDA + RTX 50 series.

Possible ways it can work

Install Nightly-PyTorch with CUDA 12.8+

Many users report that the Nightly builds have better sm_120 support.
Example workaround (manual):

pip uninstall -y torch torchvision torchaudio
pip install --no-cache-dir --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu128

The nightly binaries can contain sm_120 kernels so that CUDA operations can run.
Important: You have to do this yourself in the WebUI-venv - otherwise WebUI will retrieve the standard stable binaries again.

Disadvantages:
Nightly can be more unstable
It is possible that not all CUDA features work without errors

Open source issue tracker shows sm_120 support is still a work in progress. There is no fixed version, but support is gradually being integrated — especially in newer CUDA 12.8/12.9 binaries.
That means:
Stable builds with sm_120 support will be released in the future.
Then AUTOMATIC1111 GPU mode also works as usual.

Concrete steps for testing (recommended)

Enable WebUI venv

Completely remove all torch packages

Install Nightly binaries (cu128)

PyTorch GPU test:

python - <<'EOF'
import torch
print(torch.version.cuda)
print(torch.cuda.is_available())
print(torch.cuda.get_device_name(0))
print(torch.cuda.get_device_capability(0))
EOF

Start WebUI

If the test shows that torch.cuda.is_available() is True and sm_120 is detected, GPU inference should be possible.

I hope I could help you... good luck.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RTX 5070 (sm_120) – AUTOMATIC1111 WebUI fails on GPU, CPU works fine. Looking for a working build / dev branch. #17266

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

RTX 5070 (sm_120) – AUTOMATIC1111 WebUI fails on GPU, CPU works fine. Looking for a working build / dev branch. #17266

Uh oh!

ArienM02 Jan 27, 2026

Replies: 1 comment

Uh oh!

Uh oh!

sageliebi-maker Feb 4, 2026

ArienM02
Jan 27, 2026

sageliebi-maker
Feb 4, 2026