You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Lens](#lens) | MMDiT | 2026 | Microsoft | 3.8B | Minimal | Modern, Great Quality, lightweight |
23
24
24
25
Old or bad options also tracked listed via [Obscure Model Support](/docs/Obscure%20Model%20Support.md):
25
26
@@ -618,6 +619,22 @@ For upscaling with SD3, the `Refiner Do Tiling` parameter is highly recommended
618
619
-**Dev Lora:**
619
620
- A dev lora can be downloaded here [Kijai/hidream-O1-image_comfy](<https://huggingface.co/Kijai/hidream-O1-image_comfy/resolve/main/loras/hidream_o1_dev_lora_rank_64_bf16_pruned_v1.safetensors>). It allows use of the base model with the distilled behavior from the Dev model. 8 steps will generate a coherent image of lower quality, 16 steps seems closer to original quality. Use CFG Scale 1.
620
621
622
+
# Lens
623
+
624
+
- Microsoft's [Lens](<https://huggingface.co/microsoft/Lens>) is supported in SwarmUI!
625
+
- It is a 3.8B MMDiT model, with a base model and an official turbo distill designed to run fast.
626
+
- The "Turbo" model (in fat BF16) can be downloaded here [Comfy-Org/Lens - turbo](<https://huggingface.co/Comfy-Org/Lens/resolve/main/split_files/diffusion_models/lens_turbo_bf16.safetensors?download=true>)
627
+
- Or the base version (in fat BF16) [Comfy-Org/Lens - base](<https://huggingface.co/Comfy-Org/Lens/resolve/main/split_files/diffusion_models/lens_bf16.safetensors?download=true>)
628
+
- Save in `diffusion_models`
629
+
- Uses the Flux.2 VAE, will be downloaded and handled automatically
630
+
- Uses the GPT-OSS 20B text encoder, will be downloaded and handled automatically
631
+
-**Parameters:**
632
+
-**Sampler:** Default is fine.
633
+
-**Scheduler:** Default is fine.
634
+
-**CFG Scale:** For Turbo, `1`, for base normal CFG ranges (around `5`)
635
+
-**Steps:** For Turbo, `4` is recommended, `8` works well. For Base, `20` as normal.
636
+
-**Resolution:** Side length `1440` is the standard.
637
+
621
638
# Video Models
622
639
623
640
- Video models are documented in [Video Model Support](/docs/Video%20Model%20Support.md).
elseif(IsFlux()&&(LoadingClipisnull||LoadingVAEisnull||UserInput.Get(T2IParamTypes.T5XXLModel)is not null||UserInput.Get(T2IParamTypes.ClipLModel)is not null))
TorchCompile=Register<string>(new("Torch Compile","Torch.Compile is a way to dynamically accelerate AI models.\nIt wastes a bit of time (around a minute) on the first call compiling a graph of the generation, and then all subsequent generations run faster thanks to the compiled graph.\nTorch.Compile depends on Triton, which is difficult to install on Windows, easier on Linux.",
0 commit comments