modelscope
diff --git a/‎ajet/copilot/auto-research-blueprint-execute-classic/SKILL.md‎
Lines changed: 5 additions & 0 deletions b/‎ajet/copilot/auto-research-blueprint-execute-classic/SKILL.md‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎ajet/copilot/auto-research-blueprint-execute-swarm/SKILL.md‎
Lines changed: 5 additions & 0 deletions b/‎ajet/copilot/auto-research-blueprint-execute-swarm/SKILL.md‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎ajet/copilot/conda-install-agentjet-swarm-server/SKILL.md‎
Lines changed: 1 addition & 1 deletion b/‎ajet/copilot/conda-install-agentjet-swarm-server/SKILL.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎ajet/copilot/docker-install-agentjet-swarm-server/SKILL.md‎
Lines changed: 1 addition & 1 deletion b/‎ajet/copilot/docker-install-agentjet-swarm-server/SKILL.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎ajet/copilot/download_from_swanlab_url/SKILL.md‎
Lines changed: 189 additions & 0 deletions b/‎ajet/copilot/download_from_swanlab_url/SKILL.md‎
Lines changed: 189 additions & 0 deletions
diff --git a/‎ajet/copilot/install-agentjet-client/SKILL.md‎
Lines changed: 1 addition & 1 deletion b/‎ajet/copilot/install-agentjet-client/SKILL.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎ajet/copilot/map-verl-config/SKILL.md‎
Lines changed: 1 addition & 1 deletion b/‎ajet/copilot/map-verl-config/SKILL.md‎
Lines changed: 1 addition & 1 deletion
@@ -1,3 +1,8 @@
+---
+name: auto-research-blueprint-execute-classic
+description: Execute AgentJet reinforcement learning experiments using experiment blueprints in classic (non-swarm) mode. Handles full lifecycle: launch experiment in tmux, monitor progress, analyze errors, collect results, and write finish flag. Use when the user wants to run AgentJet training experiments without the swarm distributed framework.
+---
+
 ## 你的任务
 
 1. 根据实验蓝图，运行实验
 
@@ -1,3 +1,8 @@
+---
+name: auto-research-blueprint-execute-swarm
+description: Execute AgentJet reinforcement learning experiments using experiment blueprints in swarm mode. Handles full lifecycle: generate blueprint if needed, launch experiment in tmux, monitor progress, analyze errors, collect results, and write finish flag. Use when the user wants to run or debug AgentJet training experiments.
+---
+
 ## 你的任务
 
 0. 如果用户没有提供实验蓝图，则生成一个实验蓝图
 
@@ -1,6 +1,6 @@
 ---
 name: conda-install-agentjet-swarm-server
-description: Install agentjet swarm server with conda
+description: Install AgentJet swarm server using Conda. Handles Python 3.10 environment creation, dependency installation with the verl training backbone, flash-attn compilation, and optional PyPI mirror for China users.
 license: Complete terms in LICENSE.txt
 ---
 
 
@@ -1,6 +1,6 @@
 ---
 name: docker-install-agentjet-swarm-server
-description: Install and run AgentJet Swarm Server via Docker with GPU support
+description: Install and run the AgentJet Swarm Server in a Docker container with NVIDIA GPU support. Use when the user wants to deploy a swarm server on a GPU machine via Docker, including GPU driver setup, Docker mirror configuration, model weight mounting, and server startup.
 license: Complete terms in LICENSE.txt
 ---
 
 
@@ -0,0 +1,189 @@
+---
+name: download-from-swanlab-url
+description: Download per-step time-series metric data (reward, entropy, response length, etc.) from a SwanLab cloud run URL as a pandas.DataFrame. Use when the user provides a SwanLab URL and wants to fetch or analyze training curves.
+---
+
+# Skill: Download metric data from a SwanLab run URL
+
+## Goal
+
+Given a SwanLab cloud URL of the form
+
+```
+https://swanlab.cn/@<username>/<project>/runs/<exp_cuid>/chart
+```
+
+fetch the per-step time-series data for one or more metrics (e.g. reward, entropy, response length) as a `pandas.DataFrame` for downstream plotting or analysis.
+
+## Trial-and-error log (read first; saves hours)
+
+The shortcuts you might try **do not work**:
+
+1. **WebFetch / `curl` of the chart page** — the page is a Vue SPA. The HTML body is just `<div id="app"></div>`; no chart data is embedded. Don't bother scraping HTML.
+
+2. **Direct REST probe (`curl https://api.swanlab.cn/api/v1/runs/<cuid>`)** — returns `404 Not Found` even with a valid api key. The public REST surface is not the v1 path; use the Python SDK.
+
+3. **`swanlab verify` says you're logged in** — but the env may point to a private cloud. On this host:
+   ```
+   $ swanlab verify
+   swanlab: You are logged into https://cloud-20.agent-matrix.com as fuqingxu
+   ```
+   This is **not** swanlab.cn. The shell exports `SWANLAB_API_HOST` and `SWANLAB_WEB_HOST` in `~/.bashrc`, redirecting the SDK to a different deployment. Calling `OpenApi(api_key=...)` then fails with `Login failed: 404 Not Found` because `/login/api_key` only exists on swanlab.cn's API host.
+
+4. **Metric keys are NOT what the UI shows.** The chart card titled "reward" is logged as `critic/rewards/mean`. Asking `get_metrics(keys=['reward'])` returns code 404. You **must** probe candidate keys (see the cheat sheet below).
+
+5. **`get_summary` to enumerate keys** — does not work for non-cloned runs. When `rootExpId` / `rootProId` are `None` (i.e. the run was not cloned from another project), `experiment.get_summary` returns HTTP 400 "Bad Request". Skip it; probe candidate keys directly with `get_metrics`.
+
+## The working recipe
+
+### 1. Prerequisites
+
+- `swanlab` Python package (>= 0.7.4 confirmed).
+- An api key for `swanlab.cn`. On this machine it is stored in `~/.swanlab/.netrc`:
+  ```
+  machine https://api.swanlab.cn
+      login https://swanlab.cn
+      password <API_KEY>
+  ```
+  Read the password field from there if you don't have it on hand.
+
+### 2. Parse the URL
+
+```python
+import re
+
+URL = "https://swanlab.cn/@binaryhusky/spy-game-rl/runs/zku3ujg2k3unvt61jbu0s/chart"
+m = re.match(r"https?://swanlab\.cn/@([^/]+)/([^/]+)/runs/([^/]+)", URL)
+username, project, exp_id = m.group(1), m.group(2), m.group(3)
+```
+
+### 3. Override env vars BEFORE importing `swanlab`
+
+Critical: if your shell has `SWANLAB_API_HOST` / `SWANLAB_WEB_HOST` / `SWANLAB_API_KEY` pointing to a private cloud, the SDK silently uses them. Either run via `env -i` or unset them:
+
+```bash
+unset SWANLAB_API_KEY SWANLAB_API_HOST SWANLAB_WEB_HOST
+SWANLAB_API_HOST=https://api.swanlab.cn/api \
+SWANLAB_WEB_HOST=https://swanlab.cn \
+python your_script.py
+```
+
+Note: `SWANLAB_API_HOST` for swanlab.cn ends with `/api` (the SDK default in `swanlab/env.py`); without that suffix login also returns 404.
+
+### 4. Open the API and fetch metadata
+
+```python
+import swanlab
+api = swanlab.OpenApi(api_key="<API_KEY>")  # raises ValidationError if hosts wrong
+
+exp = api.get_experiment(project=project, exp_id=exp_id, username=username)
+assert exp.code == 200, exp.errmsg
+print(exp.data.name, exp.data.state)
+# exp.data.profile['config'] contains the full training config (verl-style nested dict)
+```
+
+### 5. Discover the right metric keys
+
+The keys you ask for must match what was logged. For a verl-based AgentJet run, the working names are:
+
+| You might think | Actual logged key       |
+| --------------- | ----------------------- |
+| `reward`        | `critic/rewards/mean`   |
+| `entropy`       | `actor/entropy`         |
+| `response_length` | `response_length/mean` |
+| `pg_loss`       | `actor/pg_loss`         |
+
+Other common ones: `response_length/max`, `response_length/min`, `critic/score/mean`. Probe defensively — `get_metrics` returns `code=200` and N rows on hit, `code=404, rows=0` on miss:
+
+```python
+candidates = [
+    "critic/rewards/mean", "critic/score/mean",
+    "actor/entropy", "actor/entropy_loss", "actor/pg_loss",
+    "response_length/mean", "response_length/max", "response_length/min",
+]
+for k in candidates:
+    r = api.get_metrics(exp_id=exp_id, keys=k)
+    print(f"{k:40s} code={r.code} rows={0 if r.data is None else len(r.data)}")
+```
+
+### 6. Fetch and save
+
+```python
+keys = ["critic/rewards/mean", "actor/entropy", "response_length/mean"]
+r = api.get_metrics(exp_id=exp_id, keys=keys)
+df = r.data  # indexed by `step`; one column per key + one `<key>_timestamp` column
+df.to_csv("metrics.csv")
+```
+
+The DataFrame layout is:
+
+```
+              actor/entropy  actor/entropy_timestamp  critic/rewards/mean  ...
+step
+1                  0.5569            1774003810000               0.7271
+2                  0.5732            1774004325000               0.7589
+...
+```
+
+The `_timestamp` columns are unix millis; usually you can drop them and plot against the `step` index.
+
+## End-to-end runnable snippet
+
+```python
+"""Fetch reward/entropy/response_length curves from a swanlab.cn run URL."""
+import os, re, sys
+# Strip any private-cloud overrides BEFORE importing swanlab.
+for v in ("SWANLAB_API_KEY", "SWANLAB_API_HOST", "SWANLAB_WEB_HOST"):
+    os.environ.pop(v, None)
+os.environ["SWANLAB_API_HOST"] = "https://api.swanlab.cn/api"
+os.environ["SWANLAB_WEB_HOST"] = "https://swanlab.cn"
+
+import swanlab
+
+URL = sys.argv[1]
+API_KEY = sys.argv[2]  # or read from ~/.swanlab/.netrc
+m = re.match(r"https?://swanlab\.cn/@([^/]+)/([^/]+)/runs/([^/]+)", URL)
+username, project, exp_id = m.groups()
+
+api = swanlab.OpenApi(api_key=API_KEY)
+keys = ["critic/rewards/mean", "actor/entropy", "response_length/mean"]
+r = api.get_metrics(exp_id=exp_id, keys=keys)
+assert r.code == 200, r.errmsg
+df = r.data.rename(columns={
+    "critic/rewards/mean": "reward",
+    "actor/entropy": "entropy",
+    "response_length/mean": "response_length",
+})
+df.to_csv("metrics.csv")
+print(df.head())
+```
+
+## Plot recipe (seaborn, optional)
+
+```python
+import seaborn as sns, matplotlib.pyplot as plt, pandas as pd
+df = pd.read_csv("metrics.csv").sort_values("step")
+sns.set_theme(context="paper", style="whitegrid")
+fig, axes = plt.subplots(1, 3, figsize=(11.5, 3.0))
+palette = sns.color_palette("deep")
+for ax, (col, title, c) in zip(axes, [
+    ("reward", "Reward", palette[0]),
+    ("entropy", "Policy Entropy", palette[3]),
+    ("response_length", "Response Length", palette[2]),
+]):
+    sns.lineplot(data=df, x="step", y=col, ax=ax, color=c, linewidth=1.6)
+    ax.set_title(title); ax.set_xlabel("Training step"); ax.set_ylabel("")
+fig.tight_layout()
+fig.savefig("curves.pdf", bbox_inches="tight")
+```
+
+## Troubleshooting cheat sheet
+
+| Symptom | Likely cause | Fix |
+| --- | --- | --- |
+| `ValidationError: Login failed: 404 Not Found` on `OpenApi(api_key=...)` | `SWANLAB_API_HOST` points to a private cloud, or missing `/api` suffix | Unset the env vars and explicitly set `SWANLAB_API_HOST=https://api.swanlab.cn/api` |
+| `get_metrics` returns `code=404, "No data found"` | Wrong key name (UI label != log key) | Probe with the cheat sheet in §5; remember verl prefixes (`actor/`, `critic/`, `response_length/...`) |
+| `get_summary` returns `code=400, "Bad Request"` | Run is not a clone; `rootExpId`/`rootProId` are None | Don't use `get_summary` for non-cloned runs; just probe `get_metrics` |
+| `OpenApi.login_info` is not callable | It's a property, not a method | Access as `api.login_info` (no `()`) |
+| WebFetch returns "no data" / SPA shell | Chart page is Vue-rendered client-side | Use the SDK; do not scrape HTML |
+| `swanlab verify` shows wrong host | `~/.bashrc` exports redirect SDK to a private cloud | Override env at script start, before `import swanlab` |
@@ -1,6 +1,6 @@
 ---
 name: install-agentjet-client
-description: Install agentjet swarm server with uv package manager
+description: Install AgentJet client for connecting to a swarm server. Use when the user only needs to run the AgentJet client (not a swarm server) and does not need to run models locally, e.g. on a laptop. Installs basic requirements via `pip install -e .`.
 license: Complete terms in LICENSE.txt
 ---
 
 
@@ -1,6 +1,6 @@
 ---
 name: map-verl-config
-description: map verl config to agentjet config
+description: Map VERL training configuration to AgentJet configuration. Find VERL config in verl_default.yaml, check for existing mappings in config_auto_convertion_verl.jsonc, add new mappings to ajet_default.yaml and the conversion schema, and optionally add parameters to AgentJetJob.
 license: Complete terms in LICENSE.txt
 ---