[Kernel] feat: add Metal support for Apple Silicon GPU by AlpinDale · Pull Request #1415 · dphnAI/aphrodite-engine

AlpinDale · 2025-08-12T00:30:36Z

Adding native support to Apple M-series GPUs through Metal shading language for the kernels. Currently, attention is implemented through Torch SDPA's MPS backend, and custom paged attention metal kernels.

To test, first make sure xcode is installed:

$ xcode-select --install

Then configure the path:

$ xcode-select --switch $(xcode-select --print-path)

Then build:

$ APHRODITE_TARGET_DEVICE=mps pip install -e .

You can run the API server:

$ aphrodite run Qwen/Qwen3-0.6B --no-enable-chunked-prefill --no-enable-prefix-caching

Benchmarks

Qwen3-0.6B BF16, Apple M4 Pro (MacBook), Batch Size 1:

CPU Backend:

Prefill: 110.1 tokens/s, Decode: 22.1 tokens/s

MPS Backend:

Prefill: 2415.7 tokens/s, Decode: 14.8 tokens/s

MLX (LMStudio):

Prefill: 1318.4 tok/s, Decode: 138.53

Currently, prefill is leagues faster but decode takes a hit. Needs more work. The MPS backend also has accuracy issues, so we get incorrect outputs (not NaN or nonsense).

Tests

$ pytest tests/kernels/attention/test_attention.py
$ pytest tests/kernels/core/test_activation.py
$ pytest tests/kernels/core/test_layernorm.py
$ pytest tests/kernels/core/test_pos_encoding.py

TODO

AlpinDale added 9 commits August 12, 2025 04:55

[Kernel] feat: add Metal support for Apple Silicon GPU

f101554

add activation kernels

508f1b4

add rms_norm kernels

6107c35

add rotary embeddings kernels

d765795

remove dead code

93cc0d8

get python executable dynamically

774921f

find_package for python dev headers

41171a2

implement mps worker and model runner

fd8172f

make setup.py installs work

67d00ba

AlpinDale mentioned this pull request Sep 3, 2025

[Feature]: Add support for Apple MPS(Metal Performance Shaders) vllm-project/vllm#22629

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Kernel] feat: add Metal support for Apple Silicon GPU#1415

[Kernel] feat: add Metal support for Apple Silicon GPU#1415
AlpinDale wants to merge 9 commits into
mainfrom
metal-backend

AlpinDale commented Aug 12, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

AlpinDale commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

Tests

TODO

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

AlpinDale commented Aug 12, 2025 •

edited

Loading