Skip to content

Commit 6ea87bc

Browse files
committed
small update
1 parent ff6e6e7 commit 6ea87bc

2 files changed

Lines changed: 34 additions & 0 deletions

File tree

_posts/2026-04-27-Running-a-Local-LLM-Coding-Agent-on-Scaleway.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -769,6 +769,23 @@ PUBLIC_IP=$(scw instance server get $INSTANCE_ID zone=fr-par-2 -o json \
769769

770770
This works because every resource was created inside a dedicated project (§1 step 3), with a known name or tag. For longer-lived deployments, Terraform / OpenTofu / Pulumi are the right tools.
771771

772+
### Stopping llama-server
773+
774+
`Ctrl+C` inside the tmux session works, but the cleanest one-shot (run on the **remote instance**) is:
775+
776+
```bash
777+
tmux kill-session -t llmserver
778+
```
779+
780+
This kills the session and the server process in one step, freeing VRAM. Verify:
781+
782+
```bash
783+
ss -tlnp | grep 8080 # should print nothing
784+
nvidia-smi --query-gpu=memory.used --format=csv # should drop near 0
785+
```
786+
787+
Useful if you want to relaunch llama-server with different flags (e.g. a larger `--ctx-size`) without rebooting the instance.
788+
772789
### Stopping the instance
773790

774791
Compute billing pauses. Volume and IP continue at ≈ €0.38/day combined. Everything on the volume survives.

_posts/WP_2026-04-27-Running-a-Local-LLM-Coding-Agent-on-Scaleway.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -749,6 +749,23 @@ PUBLIC_IP=$(scw instance server get $INSTANCE_ID zone=fr-par-2 -o json \
749749

750750
This works because every resource was created inside a dedicated project (§1 step 3), with a known name or tag. For longer-lived deployments, Terraform / OpenTofu / Pulumi are the right tools.
751751

752+
### Stopping llama-server
753+
754+
`Ctrl+C` inside the tmux session works, but the cleanest one-shot (run on the **remote instance**) is:
755+
756+
```bash
757+
tmux kill-session -t llmserver
758+
```
759+
760+
This kills the session and the server process in one step, freeing VRAM. Verify:
761+
762+
```bash
763+
ss -tlnp | grep 8080 # should print nothing
764+
nvidia-smi --query-gpu=memory.used --format=csv # should drop near 0
765+
```
766+
767+
Useful if you want to relaunch llama-server with different flags (e.g. a larger `--ctx-size`) without rebooting the instance.
768+
752769
### Stopping the instance
753770

754771
Compute billing pauses. Volume and IP continue at ≈ €0.38/day combined. Everything on the volume survives.

0 commit comments

Comments
 (0)