Conversation
|
Hi eruanno123, I am also intereted by ERNIE. I updated krita-ai-diffusion from Github. |
|
@hansnolte I tried the NVFP4 variant and it seems to work well on my side. In your setup:
|
|
Hi eruanno123,
I have also ernie-image-turbo-Q6_K.gguf copied to the diffusion_models Folder. The GGUF give me this error. Server error: mat1 and mat2 shapes cannot be multiplied (512x4096 and 3072x4096) |
|
Another thing to check (this is how I verified the tooling nodes work): It should return something like this: I quickly checked the GGUF variants, and they are not detected on my side. After taking a closer look, I think the tooling nodes need further updates - there is an EDIT: it is actually detected, I was checking wrong endpoint (should be unet_gguf), but it is discovered as "WAN" model, which confirms my conclusion that |
|
I don't have time to try the model deeply, from what I can see it doesn't immediately seem like a (significant) improvement on Flux2/ZIT. But I think we can add basic support as sketched here. For now, please remove the changes to |
|
Hi eruanno123, I have just managed to get both variants to run in ComfyUI. Again, thank you for your help |
|
@hansnolte You’re welcome. I’m getting the impression that base model is way easier to train, which is why I integrated it into my local workflow. It’s still not very clear to me what the proper way to train LoRAs for the ZIB model is. @Acly, thanks for taking the time. I've reverted the changes you mentioned, aligned the “E” more precisely, and fixed the failing unit test. |
|
Could you do a rebase please? |
7fae2c9 to
b831db2
Compare
b831db2 to
2b45074
Compare

Adds support for ERNIE Image by Baidu - a diffusion model using a Ministral-3B text encoder and the Flux 2 VAE.
@Acly, I am sharing this PR as is, based on what I quickly developed for my needs (as I was a bit impatient to wait :D). If it passes initial review, I can spend some time polishing it (items mentioned in Todos).
Model files:
Architecture highlights:
Requires a patch to comfyui-tooling-nodes to report base_model: "ernie-image" for ERNIE diffusion models (currently returns "unknown"): Acly/comfyui-tooling-nodes#63
Preview: