Further Reduce LTX VAE decode peak RAM usage by kijai · Pull Request #13052 · Comfy-Org/ComfyUI

kijai · 2026-03-18T22:09:56Z

Further reduces LTX2 VAE peak RAM to output level.

Make VAE decoder write decoded chunks directly into a pre-allocated output buffer, eliminating intermediate allocations and the full-output torch.cat
unpatchify runs per-chunk on GPU instead of on the full output on CPU
When the VAE supports decode_output_shape, the caller passes its output buffer directly to the decoder, eliminating the intermediate bf16 buffer entirely

coderabbitai · 2026-03-18T22:16:38Z

📝 Walkthrough

Walkthrough

The changes introduce buffer-based decoding optimization to the video VAE pipeline. The Decoder class now supports preallocation of output buffers through a new decode_output_shape method and accepts an optional output_buffer parameter in forward_orig, enabling direct writes instead of tensor concatenation. VideoVAE delegates these new capabilities and passes the output_buffer through the decode path. The VAE.decode method detects support for decode_output_shape and conditionally preallocates pixel_samples for direct buffer writes when available, falling back to the original approach otherwise.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 11.11% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and concisely summarizes the main objective: reducing peak RAM usage during LTX VAE decoding, which is the primary focus of all changes.
Description check	✅ Passed	The description directly relates to the changeset by detailing the implementation approach: preallocated output buffers, per-chunk unpatchify, and decode_output_shape support.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

CodeRabbit can enforce grammar and style rules using `languagetool`.

Configure the reviews.tools.languagetool setting to enable/disable rules and categories. Refer to the LanguageTool Community to learn more.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@comfy/sd.py`:
- Around line 956-964: The code currently assumes that having
first_stage_model.decode_output_shape implies first_stage_model.decode accepts
an output_buffer kwarg, which can raise a TypeError; update the logic around
preallocated/pixel_samples to verify the decode() signature (e.g., via
inspect.signature or a safe trial call) before setting preallocated True and
passing output_buffer to first_stage_model.decode, and if decode() does not
accept output_buffer then fall back to the safe copy path (call decode without
output_buffer and copy into pixel_samples) so that
first_stage_model.decode_output_shape, first_stage_model.decode, pixel_samples,
preallocated and vae_options are handled compatibly.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 1f0bf258-0c0d-4fdf-aa54-c99f3a39d8c9

📥 Commits

Reviewing files that changed from the base of the PR and between b67ed2a and fbc97e0.

📒 Files selected for processing (2)

comfy/ldm/lightricks/vae/causal_video_autoencoder.py
comfy/sd.py

comfy/sd.py

Further Reduce LTX VAE decode peak RAM usage

fbc97e0

kijai requested review from Kosinkadink, comfyanonymous and guill as code owners March 18, 2026 22:09

coderabbitai bot reviewed Mar 18, 2026

View reviewed changes

comfy/sd.py Show resolved Hide resolved

comfyanonymous merged commit 9fff091 into Comfy-Org:master Mar 18, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further Reduce LTX VAE decode peak RAM usage#13052

Further Reduce LTX VAE decode peak RAM usage#13052
comfyanonymous merged 1 commit intoComfy-Org:masterfrom
kijai:ltx2vae_ram

kijai commented Mar 18, 2026

Uh oh!

coderabbitai bot commented Mar 18, 2026

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kijai commented Mar 18, 2026

Uh oh!

coderabbitai bot commented Mar 18, 2026

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants