Release v0.7.0: supports multiple new image models · modelscope/DiffSynth-Engine

Supports Qwen-Image-Edit-2511, Qwen-Image-2512, Z-Image-Turbo and Z-Image-Omni-Base (To Be Released)

Qwen-Image-Edit-2511 is an enhanced version over Qwen-Image-Edit-2509 with notably better consistency and other multiple improvements.
Qwen-Image-2512 is an updated text-to-image model with enhanced human realism, finer natural detail and improved text rendering.
Z-Image is a powerful and highly efficient image generation model with 6B parameters.
Other fixes.

What's Changed

init all dit module with device and dtype for speed up by @qzzz95 in #164
fix wan umt5 state dict converter by @akaitsuki-ii in #170
ADD update_weights for flux and qwen_image by @qzzz95 in #168
add progress by @tenderness-git in #171
ADD Option: OFFLINE fetch modelscope model by @qzzz95 in #172
supports flash attn 3 fp8 by @akaitsuki-ii in #174
use utf8 by @tenderness-git in #178
bug fix by @tenderness-git in #179
supports flux kontext with multiple input images by @akaitsuki-ii in #173
Feature/qwen image control by @Glaceon-Hyy in #176
Feature/qwen edit plus by @Glaceon-Hyy in #180
fix key convert for lora kohya lora by @qzzz95 in #183
supports sequence parallel and use custom image size for Qwen Image by @akaitsuki-ii in #186
convert qwen image diffusers lora key by @qzzz95 in #189
enable FSDP for qwen vl by @akaitsuki-ii in #184
torch.compile with dynamic=False by @akaitsuki-ii in #185
fix compile repeated blocks by @akaitsuki-ii in #191
fix redux multiple ref images by @qzzz95 in #192
define qwen image edit system prompt by @qzzz95 in #194
Fix Wan2.2 low noise model load LoRA bug by @continue-revolution in #188
fix mask dtype differ from latent by @qzzz95 in #195
load encoder optional by @qzzz95 in #196
video sparse attention by @akaitsuki-ii in #190
Fix/qwen image by @akaitsuki-ii in #197
Fix/import vsa by @akaitsuki-ii in #200
Enable aiter attention for rocm by @guangzlu in #198
auto enable vsa by @akaitsuki-ii in #203
support svd quant by @Glaceon-Hyy in #202
Fix circular dependence by @qzzz95 in #205
suppor lora loading from state dict by @qzzz95 in #206
set module device to skip weight init by @qzzz95 in #207
fix svd init memory by @Glaceon-Hyy in #208
support edit 2511 by @Glaceon-Hyy in #212
修复 qwen edit 2511 序列并行报错 & 修复 timesteps 对不齐问题 & 修复 image resize 算法对不齐 by @qzzz95 in #214
support z image by @Glaceon-Hyy in #213
add edit 2511 example by @qzzz95 in #215
Fix Z Image model default dtype by @qzzz95 in #216
Support diffusers and diffsynth studio lora by @qzzz95 in #217
add WanDMDPipeline by @akaitsuki-ii in #219
add FlashAtten 4 API by @bingchenlll in #218
safety check module available by @qzzz95 in #225
support Z-Image-Omni-Base by @Artiprocher in #226

New Contributors

@guangzlu made their first contribution in #198
@bingchenlll made their first contribution in #218

Full Changelog: v0.6.0...v0.7.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.7.0: supports multiple new image models

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Supports Qwen-Image-Edit-2511, Qwen-Image-2512, Z-Image-Turbo and Z-Image-Omni-Base (To Be Released)

Contributors

Uh oh!