Supports Qwen-Image-Edit-2511, Qwen-Image-2512, Z-Image-Turbo and Z-Image-Omni-Base (To Be Released)
- Qwen-Image-Edit-2511 is an enhanced version over Qwen-Image-Edit-2509 with notably better consistency and other multiple improvements.
- Qwen-Image-2512 is an updated text-to-image model with enhanced human realism, finer natural detail and improved text rendering.
- Z-Image is a powerful and highly efficient image generation model with 6B parameters.
- Other fixes.
What's Changed
- init all dit module with device and dtype for speed up by @qzzz95 in #164
- fix wan umt5 state dict converter by @akaitsuki-ii in #170
- ADD update_weights for flux and qwen_image by @qzzz95 in #168
- add progress by @tenderness-git in #171
- ADD Option: OFFLINE fetch modelscope model by @qzzz95 in #172
- supports flash attn 3 fp8 by @akaitsuki-ii in #174
- use utf8 by @tenderness-git in #178
- bug fix by @tenderness-git in #179
- supports flux kontext with multiple input images by @akaitsuki-ii in #173
- Feature/qwen image control by @Glaceon-Hyy in #176
- Feature/qwen edit plus by @Glaceon-Hyy in #180
- fix key convert for lora kohya lora by @qzzz95 in #183
- supports sequence parallel and use custom image size for Qwen Image by @akaitsuki-ii in #186
- convert qwen image diffusers lora key by @qzzz95 in #189
- enable FSDP for qwen vl by @akaitsuki-ii in #184
- torch.compile with dynamic=False by @akaitsuki-ii in #185
- fix compile repeated blocks by @akaitsuki-ii in #191
- fix redux multiple ref images by @qzzz95 in #192
- define qwen image edit system prompt by @qzzz95 in #194
- Fix Wan2.2 low noise model load LoRA bug by @continue-revolution in #188
- fix mask dtype differ from latent by @qzzz95 in #195
- load encoder optional by @qzzz95 in #196
- video sparse attention by @akaitsuki-ii in #190
- Fix/qwen image by @akaitsuki-ii in #197
- Fix/import vsa by @akaitsuki-ii in #200
- Enable aiter attention for rocm by @guangzlu in #198
- auto enable vsa by @akaitsuki-ii in #203
- support svd quant by @Glaceon-Hyy in #202
- Fix circular dependence by @qzzz95 in #205
- suppor lora loading from state dict by @qzzz95 in #206
- set module device to skip weight init by @qzzz95 in #207
- fix svd init memory by @Glaceon-Hyy in #208
- support edit 2511 by @Glaceon-Hyy in #212
- 修复 qwen edit 2511 序列并行报错 & 修复 timesteps 对不齐问题 & 修复 image resize 算法对不齐 by @qzzz95 in #214
- support z image by @Glaceon-Hyy in #213
- add edit 2511 example by @qzzz95 in #215
- Fix Z Image model default dtype by @qzzz95 in #216
- Support diffusers and diffsynth studio lora by @qzzz95 in #217
- add WanDMDPipeline by @akaitsuki-ii in #219
- add FlashAtten 4 API by @bingchenlll in #218
- safety check module available by @qzzz95 in #225
- support Z-Image-Omni-Base by @Artiprocher in #226
New Contributors
- @guangzlu made their first contribution in #198
- @bingchenlll made their first contribution in #218
Full Changelog: v0.6.0...v0.7.0