Skip to content

Commit 1762bab

Browse files
danbevJcfunk
authored andcommitted
examples : update args speculative-simple README.md [no ci] (ggml-org#22938)
This commit updates the command line arguments to use the correct names and values which are now required. The motivation for this change is that currently running the example command as is will generate the following errors: ```console error while handling argument "--color": error: unknown value for --color: '--sampling-seq' usage: -co, --color [on|off|auto] Colorize output to distinguish prompt and user input from generations ('on', 'off', or 'auto', default: 'auto') 'auto' enables colors when output is to a terminal error while handling argument "-fa": error: unknown value for --flash-attn: '--temp' usage: -fa, --flash-attn [on|off|auto] set Flash Attention use ('on', 'off', or 'auto', default: 'auto') (env: LLAMA_ARG_FLASH_ATTN) error while handling argument "--draft-max": the argument has been removed. use --spec-draft-n-max or --spec-ngram-mod-n-max usage: --draft, --draft-n, --draft-max N the argument has been removed. use --spec-draft-n-max or --spec-ngram-mod-n-max (env: LLAMA_ARG_DRAFT_MAX) error while handling argument "--draft-min": the argument has been removed. use --spec-draft-n-min or --spec-ngram-mod-n-min usage: --draft-min, --draft-n-min N the argument has been removed. use --spec-draft-n-min or --spec-ngram-mod-n-min (env: LLAMA_ARG_DRAFT_MIN) ```
1 parent 88e22ab commit 1762bab

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

examples/speculative-simple/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Demonstration of basic greedy speculative decoding
66
./bin/llama-speculative-simple \
77
-m ../models/qwen2.5-32b-coder-instruct/ggml-model-q8_0.gguf \
88
-md ../models/qwen2.5-1.5b-coder-instruct/ggml-model-q4_0.gguf \
9-
-f test.txt -c 0 -ngl 99 --color \
10-
--sampling-seq k --top-k 1 -fa --temp 0.0 \
11-
-ngld 99 --draft-max 16 --draft-min 5 --draft-p-min 0.9
9+
-f test.txt -c 0 -ngl 99 --color on \
10+
--sampling-seq k --top-k 1 -fa on --temp 0.0 \
11+
-ngld 99 --spec-draft-n-max 16 --spec-draft-n-draft-min 5 --draft-p-min 0.9
1212
```

0 commit comments

Comments
 (0)