Commit 6b781eb
quantcpp 0.9.2: add Llama-3.2-1B to model registry
Model.from_pretrained("Llama-3.2-1B") auto-downloads the Q4_K_M GGUF
(~750 MB) from hugging-quants on HuggingFace. Much better response
quality than the 135M starter model — suitable for Reddit demos and
first-impression showcases.
README quick start now defaults to Llama-3.2-1B with SmolLM2 as a
smaller alternative. Also adds quantcpp.available_models() helper.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent a77fbe5 commit 6b781eb
3 files changed
Lines changed: 14 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
39 | | - | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
40 | 41 | | |
41 | 42 | | |
42 | 43 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
| 10 | + | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | | - | |
| 22 | + | |
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
56 | 61 | | |
57 | 62 | | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
58 | 67 | | |
59 | 68 | | |
60 | 69 | | |
| |||
0 commit comments