Commit 4248327
committed
Address PR review comments
- CLIP scoring: use CPU to avoid OOM with WAN pipeline on GPU; catch
RuntimeError in addition to OSError
- --skip-threshold help: fix description to match actual exp(tile_max -
running_max) < lambda criterion
- vLLM worker: reject unsupported sparse presets (non-triton backend or
unknown method) with a clear ValueError instead of silently degrading
to dense attention
- PYTHONPATH construction: use os.pathsep and skip empty entries to
avoid CWD injection when PYTHONPATH is unset
- diffusers_triton backend: raise ValueError when mixed with other
backends instead of silently skipping _attn_implementation setup
- _wan_forward_triton: fall back to SDPA when attention_mask is not
None to preserve masking semantics
Signed-off-by: Ye Yu <yeyu@nvidia.com>1 parent 8996ef1 commit 4248327
5 files changed
Lines changed: 37 additions & 7 deletions
File tree
- examples
- diffusers/quantization
- vllm_serve
- modelopt/torch/sparsity/attention_sparsity
- plugins
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
794 | 794 | | |
795 | 795 | | |
796 | 796 | | |
797 | | - | |
798 | | - | |
| 797 | + | |
| 798 | + | |
| 799 | + | |
799 | 800 | | |
800 | 801 | | |
801 | 802 | | |
| |||
830 | 831 | | |
831 | 832 | | |
832 | 833 | | |
833 | | - | |
| 834 | + | |
834 | 835 | | |
835 | 836 | | |
836 | 837 | | |
837 | | - | |
| 838 | + | |
838 | 839 | | |
839 | 840 | | |
840 | | - | |
| 841 | + | |
841 | 842 | | |
842 | 843 | | |
843 | 844 | | |
| |||
847 | 848 | | |
848 | 849 | | |
849 | 850 | | |
850 | | - | |
| 851 | + | |
851 | 852 | | |
852 | 853 | | |
853 | 854 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
143 | 143 | | |
144 | 144 | | |
145 | 145 | | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
146 | 156 | | |
147 | 157 | | |
148 | 158 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
75 | | - | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
76 | 78 | | |
77 | 79 | | |
78 | 80 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
63 | 63 | | |
64 | 64 | | |
65 | 65 | | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
66 | 71 | | |
67 | 72 | | |
68 | 73 | | |
| |||
Lines changed: 12 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
208 | 208 | | |
209 | 209 | | |
210 | 210 | | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
211 | 223 | | |
212 | 224 | | |
213 | 225 | | |
| |||
0 commit comments