Skip to content

Commit 3577af3

Browse files
gHashTagona-agent
andcommitted
Increase Fly.io memory to 2GB (OOM fix)
Model requires ~870MB for dequantized weights + buffers. Co-authored-by: Ona <no-reply@ona.com>
1 parent 3d330cc commit 3577af3

1 file changed

Lines changed: 5 additions & 5 deletions

File tree

fly.toml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -12,12 +12,12 @@ primary_region = "iad"
1212
TEMPERATURE = "0.7"
1313
TOP_P = "0.9"
1414

15-
# Use shared-cpu-1x for SmolLM-135M (small model)
16-
# 1GB RAM needed for model loading + inference buffers
15+
# Use shared-cpu-2x for SmolLM-135M
16+
# 2GB RAM needed: model (139MB) + dequantized weights (~600MB) + buffers
1717
[[vm]]
18-
size = "shared-cpu-1x"
19-
memory = "1gb"
20-
cpus = 1
18+
size = "shared-cpu-2x"
19+
memory = "2gb"
20+
cpus = 2
2121

2222
# Persistent volume for models (optional - model is baked into image)
2323
# [[mounts]]

0 commit comments

Comments
 (0)