Skip to content

Commit b8a810f

Browse files
committed
docs: update quick start and curl snippet to demonstrate 122B model deduplication JSON query
1 parent f93da86 commit b8a810f

1 file changed

Lines changed: 7 additions & 3 deletions

File tree

README.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,8 @@ swift build -c release
5959

6060
```bash
6161
.build/release/mlx-server \
62-
--model mlx-community/Qwen2.5-3B-Instruct-4bit \
62+
--model Qwen3.5-122B-A10B-4bit \
63+
--stream-experts true \
6364
--port 5413
6465
```
6566

@@ -83,9 +84,12 @@ Drop-in compatible with standard OpenAI HTTP consumers:
8384
curl http://localhost:5413/v1/chat/completions \
8485
-H "Content-Type: application/json" \
8586
-d '{
86-
"model": "mlx-community/Qwen2.5-3B-Instruct-4bit",
87+
"model": "Qwen3.5-122B-A10B-4bit",
8788
"stream": true,
88-
"messages": [{"role": "user", "content": "Explain the speed of light."}]
89+
"messages": [
90+
{"role": "system", "content": "You are Aegis-AI, a local home security agent. Output strictly in JSON format."},
91+
{"role": "user", "content": "Clip 1: Delivery person drops package at 14:02. Clip 2: Delivery person walks away down driveway at 14:03. Do these clips represent the same security event? Output a JSON object with a `duplicate` boolean and a `reason` string."}
92+
]
8993
}'
9094
```
9195

0 commit comments

Comments
 (0)