Compare commits

...

2 Commits

Author SHA1 Message Date
Ettore Di Giacinto
11c5fd677d fix(gallery/ltx-2.3): add vae_decode_only:false for i2v / flf2v
LTX-2.3 i2v inference fails inside generate_video with:

  [ERROR] LTXAV image conditioning requires VAE encoder weights;
  create the context with vae_decode_only=false

Without vae_decode_only:false in the options block, gosd.cpp creates
the sd_ctx with VAE encoder weights freed, so latent encoding of the
init_image is impossible. Adding the option mirrors what we already
do for Wan i2v entries.

Affects all six LTX-2.3 entries (dev/distilled × UD-Q4_K_M, Q4_K_M,
Q8_0). T2V wasn't impacted by the missing option since it has no
init image to encode, which is why the T2V smoke earlier passed.

Assisted-by: Claude:claude-opus-4-7
2026-05-25 19:33:55 +00:00
Ettore Di Giacinto
b3300ef207 fix(gallery/ltx-2.3): add diffusion_model flag to all variants
LTX-2.3 entries (dev / distilled, UD-Q4_K_M / Q4_K_M / Q8_0) were
missing the `diffusion_model` option in their overrides. Without it,
gosd.cpp routes the main GGUF through the regular `model_path` code
path in sd.cpp, which doesn't apply the `model.diffusion_model.` tensor
prefix. sd.cpp's LTX-2.3 architecture detection (`VERSION_LTXAV`) in
get_sd_version checks for prefixed tensor names — without the prefix,
detection fails and load_model returns "could not load model".

This is the same bug we hit for Wan when the option was missing.
Adding `- diffusion_model` to all six LTX-2.3 entries' option blocks
makes load_model take the diffusion_model_path branch so detection
succeeds.

Assisted-by: Claude:claude-opus-4-7
2026-05-25 16:49:37 +00:00

View File

@@ -30844,6 +30844,8 @@
parameters:
model: ltx-2.3-22b-dev-UD-Q4_K_M.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-dev_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-dev_audio_vae.safetensors
@@ -30875,6 +30877,8 @@
parameters:
model: ltx-2.3-22b-dev-Q4_K_M.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-dev_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-dev_audio_vae.safetensors
@@ -30906,6 +30910,8 @@
parameters:
model: ltx-2.3-22b-dev-Q8_0.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-dev_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-dev_audio_vae.safetensors
@@ -30965,6 +30971,8 @@
parameters:
model: ltx-2.3-22b-distilled-UD-Q4_K_M.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-distilled_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-distilled_audio_vae.safetensors
@@ -30995,6 +31003,8 @@
parameters:
model: ltx-2.3-22b-distilled-Q4_K_M.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-distilled_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-distilled_audio_vae.safetensors
@@ -31025,6 +31035,8 @@
parameters:
model: ltx-2.3-22b-distilled-Q8_0.gguf
options:
- diffusion_model
- "vae_decode_only:false"
- llm_path:gemma-3-12b-it-qat-UD-Q4_K_XL.gguf
- vae_path:ltx-2.3-22b-distilled_video_vae.safetensors
- audio_vae_path:ltx-2.3-22b-distilled_audio_vae.safetensors