Skip to content

Commit 111b2aa

Browse files
authored
Update README.md
1 parent bac287f commit 111b2aa

1 file changed

Lines changed: 20 additions & 1 deletion

File tree

README.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,26 @@ What features are currently implemented:
1010
- This library is compiled against the netstandard 2.1, that means you can use it both in .NET 5+ applications and the Unity game engine, allowing this to be used for games as well.
1111

1212
You still have to build [s2.dll](https://github.com/rodrigomatta/s2.cpp) manually along with the ggml.
13-
Occasionally I'll include pre-built versions of S2 along with the ggml dll's, but usually they'll only come with CUDA/VULKAN/CPU support since I don't use crapple(apple).
13+
14+
Occasionally I'll include pre-built versions of S2 along with the ggml dll's in [Releases](https://github.com/subspecs/FishS2Sharp/releases/), but usually they'll only come with CUDA/VULKAN/CPU support since I don't use crapple(apple).
15+
16+
17+
## Model variants
18+
19+
GGUF files are available at [rodrigomt/s2-pro-gguf](https://huggingface.co/rodrigomt/s2-pro-gguf) on Hugging Face.
20+
21+
| File | Size | Notes |
22+
|---|---|---|
23+
| `s2-pro-f16.gguf` | 9.9 GB | Full precision — reference quality |
24+
| `s2-pro-q8_0.gguf` | 5.6 GB | Near-lossless — recommended for 8+ GB VRAM |
25+
| `s2-pro-q6_k.gguf` | 4.5 GB | Good quality/size balance — recommended for 6+ GB VRAM |
26+
| `s2-pro-q5_k_m.gguf` | 4.0 GB | Smaller with still-good quality |
27+
| `s2-pro-q4_k_m.gguf` | 3.6 GB | Best compact variant so far in quick RU validation |
28+
| `s2-pro-q3_k.gguf` | 3.0 GB | Usable, but starts stretching short words |
29+
| `s2-pro-q2_k.gguf` | 2.6 GB | Lowest-size experimental variant |
30+
31+
All variants include both the transformer weights and the audio codec in a single file.
32+
The quantized variants above were regenerated with the codec tensors (`c.*`) kept in `F16`, so only the AR transformer is quantized.
1433

1534
Example usage:
1635
```C#

0 commit comments

Comments
 (0)