Skip to content

Commit fda79a8

Browse files
arthwrsenthilkumar6
authored andcommitted
[SCYL] add chapter for performance reference in SYCL.md (ggml-org#23315)
* add chapter for performance reference * rm unsupported GPU
1 parent e28771a commit fda79a8

2 files changed

Lines changed: 11 additions & 4 deletions

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -280,7 +280,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
280280
| [Metal](docs/build.md#metal-build) | Apple Silicon |
281281
| [BLAS](docs/build.md#blas-build) | All |
282282
| [BLIS](docs/backend/BLIS.md) | All |
283-
| [SYCL](docs/backend/SYCL.md) | Intel and Nvidia GPU |
283+
| [SYCL](docs/backend/SYCL.md) | Intel GPU |
284284
| [OpenVINO [In Progress]](docs/backend/OPENVINO.md) | Intel CPUs, GPUs, and NPUs |
285285
| [MUSA](docs/build.md#musa) | Moore Threads GPU |
286286
| [CUDA](docs/build.md#cuda) | Nvidia GPU |

docs/backend/SYCL.md

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@
55
- [News](#news)
66
- [OS](#os)
77
- [Hardware](#hardware)
8+
- [Performance Reference](#performance-reference)
89
- [Docker](#docker)
910
- [Linux](#linux)
1011
- [Windows](#windows)
@@ -51,9 +52,8 @@ The packages for FP32 and FP16 would have different accuracy and performance on
5152

5253
## News
5354

54-
- 2026.04
55-
56-
- Optimize mul_mat by reorder feature for data type: Q4_K, Q5_K, Q_K, Q8_0.
55+
- 2026.04-05
56+
- Optimize mul_mat by reorder feature for data type: Q4_K, Q5_K, Q6_K, Q8_0.
5757
- Fused MoE.
5858
- Upgrate CI and built package for oneAPI 2025.3.3, support Ubuntu 24.04 built package.
5959

@@ -150,6 +150,13 @@ On older Intel GPUs, you may try [OpenCL](/docs/backend/OPENCL.md) although the
150150

151151
NA
152152

153+
## Performance Reference
154+
155+
156+
To get the supported LLMs, GPUs, and performance reference, please check [Performance of llama.cpp on Intel GPU with SYCL backend](https://github.com/ggml-org/llama.cpp/discussions/23313).
157+
158+
You could update your test result in it directly.
159+
153160
## Docker
154161

155162
The docker build option is currently limited to *Intel GPU* targets.

0 commit comments

Comments
 (0)