Skip to content

Commit 7dbb0e9

Browse files
authored
examples : update args speculative-simple README.md [no ci] (ggml-org#22938)
This commit updates the command line arguments to use the correct names and values which are now required. The motivation for this change is that currently running the example command as is will generate the following errors: ```console error while handling argument "--color": error: unknown value for --color: '--sampling-seq' usage: -co, --color [on|off|auto] Colorize output to distinguish prompt and user input from generations ('on', 'off', or 'auto', default: 'auto') 'auto' enables colors when output is to a terminal error while handling argument "-fa": error: unknown value for --flash-attn: '--temp' usage: -fa, --flash-attn [on|off|auto] set Flash Attention use ('on', 'off', or 'auto', default: 'auto') (env: LLAMA_ARG_FLASH_ATTN) error while handling argument "--draft-max": the argument has been removed. use --spec-draft-n-max or --spec-ngram-mod-n-max usage: --draft, --draft-n, --draft-max N the argument has been removed. use --spec-draft-n-max or --spec-ngram-mod-n-max (env: LLAMA_ARG_DRAFT_MAX) error while handling argument "--draft-min": the argument has been removed. use --spec-draft-n-min or --spec-ngram-mod-n-min usage: --draft-min, --draft-n-min N the argument has been removed. use --spec-draft-n-min or --spec-ngram-mod-n-min (env: LLAMA_ARG_DRAFT_MIN) ```
1 parent dd9280a commit 7dbb0e9

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

examples/speculative-simple/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Demonstration of basic greedy speculative decoding
66
./bin/llama-speculative-simple \
77
-m ../models/qwen2.5-32b-coder-instruct/ggml-model-q8_0.gguf \
88
-md ../models/qwen2.5-1.5b-coder-instruct/ggml-model-q4_0.gguf \
9-
-f test.txt -c 0 -ngl 99 --color \
10-
--sampling-seq k --top-k 1 -fa --temp 0.0 \
11-
-ngld 99 --draft-max 16 --draft-min 5 --draft-p-min 0.9
9+
-f test.txt -c 0 -ngl 99 --color on \
10+
--sampling-seq k --top-k 1 -fa on --temp 0.0 \
11+
-ngld 99 --spec-draft-n-max 16 --spec-draft-n-draft-min 5 --draft-p-min 0.9
1212
```

0 commit comments

Comments
 (0)