Commit 46e69c9
Apply real-model insights: recommendation engine + README rankings
Updated tq_recommend_strategy() based on real Qwen3.5-0.8B A/B findings:
- uniform_4b as default (cosine 0.994, community validated)
- mixed_4b8 for large head_dim with outliers (cosine 0.994)
- uniform_2b for max compression (cosine 0.953 — A grade on real data)
- QJL/PolarQuant deprioritized (uniform is better at same bits)
README quantization types table now ranked by real model results:
1. uniform_4b (A+, 7.5x) — default production
2. mixed_4b8 (A+, 6.4x) — outlier-heavy models
3. uniform_2b (A, 14.2x) — surprisingly good
4-6. turbo/polar/qjl — research only
Added recommended configurations block (en + ko)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 2daef20 commit 46e69c9
15 files changed
Lines changed: 93 additions & 23 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
166 | 166 | | |
167 | 167 | | |
168 | 168 | | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
177 | 189 | | |
178 | 190 | | |
179 | 191 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
172 | 172 | | |
173 | 173 | | |
174 | 174 | | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | | - | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
183 | | - | |
184 | | - | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
185 | 195 | | |
186 | 196 | | |
187 | 197 | | |
| |||
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
0 commit comments