Skip to content

Commit e234e88

Browse files
authored
Update 2026-04-22-ai-agency.md
1 parent 17abdd4 commit e234e88

1 file changed

Lines changed: 34 additions & 10 deletions

File tree

_posts/2026-04-22-ai-agency.md

Lines changed: 34 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -74,18 +74,42 @@ Local AI on your own silicon is Self-Sovereign Computing in practice. Apple Sil
7474
It's also really coming of its own. Some recent advancements in leveraging the M-series silicon have double the inference speed of certain models:
7575

7676
**Metal (Q4_K_M quantization):**
77-
| |M1 Max (64GB) | M5 Max (128GB) |
78-
|----|-------|-----------|
79-
| Prompt eval: | 64 tok/s | 180 tok/s |
80-
| Decode: | 27 tok/s | 58 tok/s |
81-
77+
<table>
78+
<tr>
79+
<td></td>
80+
<td>M1 Max (64GB)</td>
81+
<td>M5 Max (128GB)</td>
82+
</tr>
83+
<tr>
84+
<td>Prompt eval:</td>
85+
<td>64 tok/s</td>
86+
<td>180 tok/s</td>
87+
</tr>
88+
<tr>
89+
<td>Decode:</td>
90+
<td>27 tok/s</td>
91+
<td>58 tok/s</td>
92+
</tr>
93+
</table>
8294

8395
**MLX (nvfp4 quantization):**
84-
| |M1 Max (64GB) | M5 Max (128GB) |
85-
|----|-------|-----------|
86-
| Prompt eval: | 9 tok/s | 14 tok/s |
87-
| Decode: | 52 tok/s | 111 tok/s |
88-
96+
<table>
97+
<tr>
98+
<td></td>
99+
<td>M1 Max (64GB)</td>
100+
<td>M5 Max (128GB)</td>
101+
</tr>
102+
<tr>
103+
<td>Prompt eval:</td>
104+
<td>9 tok/s</td>
105+
<td>14 tok/s</td>
106+
</tr>
107+
<tr>
108+
<td>Decode:</td>
109+
<td>52 tok/s </td>
110+
<td>111 tok/s</td>
111+
</tr>
112+
</table>
89113

90114
We built Self-Sovereign Identity so that your keys and credentials could stay under your control rather than being held by a platform that could revoke them. Local inference is the same principle applied to AI: the model runs where you do, on hardware you own. Combined with a self-sovereign wallet holding your keys and credentials, this is a complete stack where nothing about your digital life requires asking permission from a third party.
91115

0 commit comments

Comments
 (0)