You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/backend/SYCL.md
+12-22Lines changed: 12 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -119,7 +119,7 @@ On older Intel GPUs, you may try [OpenCL](/docs/backend/OPENCL.md) although the
119
119
*Notes:*
120
120
121
121
-**Memory**
122
-
- The device memory is a limitation when running a large model. The loaded model size, *`llm_load_tensors: buffer_size`*, is displayed in the log when running `./bin/llama-cli`.
122
+
- The device memory is a limitation when running a large model. The loaded model size, *`llm_load_tensors: buffer_size`*, is displayed in the log when running `./bin/llama-completion`.
123
123
- Please make sure the GPU shared memory from the host is large enough to account for the model's size. For e.g. the *llama-2-7b.Q4_0* requires at least 8.0GB for integrated GPU and 4.0GB for discrete GPU.
124
124
125
125
-**Execution Unit (EU)**
@@ -423,16 +423,12 @@ Choose one of following methods to run.
423
423
- Use device 0:
424
424
425
425
```sh
426
-
./examples/sycl/run-llama2.sh 0
427
-
# OR
428
-
./examples/sycl/run-llama3.sh 0
426
+
./examples/sycl/test.sh -mg 0
429
427
```
430
428
- Use multiple devices:
431
429
432
430
```sh
433
-
./examples/sycl/run-llama2.sh
434
-
# OR
435
-
./examples/sycl/run-llama3.sh
431
+
./examples/sycl/test.sh
436
432
```
437
433
438
434
2. Command line
@@ -455,13 +451,13 @@ Examples:
455
451
- Use device 0:
456
452
457
453
```sh
458
-
ZES_ENABLE_SYSMAN=1 ./build/bin/llama-cli -no-cnv -m models/llama-2-7b.Q4_0.gguf -p "Building a website can be done in 10 simple steps:" -n 400 -e -ngl 99 -sm none -mg 0
454
+
ZES_ENABLE_SYSMAN=1 ./build/bin/llama-completion -no-cnv -m models/llama-2-7b.Q4_0.gguf -p "Building a website can be done in 10 simple steps:" -n 400 -e -ngl 99 -sm none -mg 0 --mmap
459
455
```
460
456
461
457
- Use multiple devices:
462
458
463
459
```sh
464
-
ZES_ENABLE_SYSMAN=1 ./build/bin/llama-cli -no-cnv -m models/llama-2-7b.Q4_0.gguf -p "Building a website can be done in 10 simple steps:" -n 400 -e -ngl 99 -sm layer
460
+
ZES_ENABLE_SYSMAN=1 ./build/bin/llama-completion -no-cnv -m models/llama-2-7b.Q4_0.gguf -p "Building a website can be done in 10 simple steps:" -n 400 -e -ngl 99 -sm layer --mmap
465
461
```
466
462
467
463
*Notes:*
@@ -577,13 +573,13 @@ Or, use CMake presets to build:
0 commit comments