Update SKILL by alex000kim · Pull Request #9196 · skypilot-org/skypilot

alex000kim · 2026-03-26T16:48:23Z

Updates to SKILL based on several autoresearch runs from https://blog.skypilot.co/scaling-autoresearch/

gemini-code-assist

Code Review

This pull request updates the SkyPilot skill documentation by adding advanced command options, explaining workdir synchronization behavior, and providing examples for parallel experiment submission. It also refines the recommended agent feedback loop and lists common mistakes. The review feedback suggests making the Python script for job ID extraction more robust against empty results and using explicit job IDs in log commands to avoid ambiguity when multiple jobs are present.

gemini-code-assist · 2026-03-26T16:50:47Z

+job_id=$(sky queue exp-vm-01 -o json \
+  | python3 -c "import sys,json; jobs=json.load(sys.stdin)['exp-vm-01']; print(max(j['job_id'] for j in jobs))")


The Python one-liner to get the latest job ID is clever, but it's not robust against cases where no jobs are running on the cluster. If sky queue returns no jobs, json.load(sys.stdin)['exp-vm-01'] will likely be an empty list, causing max() to raise a ValueError and break the script. It's better to handle this edge case gracefully.

Suggested change

job_id=$(sky queue exp-vm-01 -o json \

| python3 -c "import sys,json; jobs=json.load(sys.stdin)['exp-vm-01']; print(max(j['job_id'] for j in jobs))")

job_id=$(sky queue exp-vm-01 -o json \

| python3 -c "import sys, json; jobs = json.load(sys.stdin).get('exp-vm-01', []); print(max(j['job_id'] for j in jobs) if jobs else '')")

gemini-code-assist · 2026-03-26T16:50:47Z

-4. **Debug**: `sky logs mycluster` (stream logs) or `ssh mycluster` (interactive)
-5. **Iterate**: `sky exec mycluster updated_task.yaml` (run on existing cluster)
-6. **Cleanup**: `sky down mycluster`
+4. **Wait for completion**: `sky logs mycluster --status` (blocks until job finishes; exits 0 on success)


For robustness, especially in programmatic use by an agent, it's better to explicitly use a job ID with sky logs. The current command sky logs mycluster --status might be ambiguous if multiple jobs are on the cluster, even if it defaults to the latest one. This would also make the example consistent with the recommendation in the callout on line 408.

Consider mentioning how to obtain the job ID (e.g., from sky launch output or sky queue) and using it here and in the following steps. For example:

4. **Wait for completion**: Get the job ID (e.g., from sky launchoutput), then runsky logs mycluster <JOB_ID> --status...

romilbhardwaj

Thanks @alex000kim!

romilbhardwaj · 2026-04-01T06:07:58Z

-4. **Debug**: `sky logs mycluster` (stream logs) or `ssh mycluster` (interactive)
-5. **Iterate**: `sky exec mycluster updated_task.yaml` (run on existing cluster)
-6. **Cleanup**: `sky down mycluster`
+4. **Wait for completion**: `sky logs mycluster <JOB_ID> --status` (blocks until job finishes; exits 0 on success; get JOB_ID from `sky queue mycluster -o json`)


instead of sky logs mycluster <JOB_ID> --status, would just streaming logs with sky logs mycluster be better for an agent? So if an application causes the job stall (e.g., timeout + retry loops), the agent can look at the output to infer next steps?

good point, updated!

update SKILL

c9bd1e6

gemini-code-assist Bot reviewed Mar 26, 2026

View reviewed changes

address comments

f0c491f

romilbhardwaj reviewed Apr 1, 2026

View reviewed changes

update with log streaming

53a32b2

romilbhardwaj approved these changes Apr 7, 2026

View reviewed changes

romilbhardwaj merged commit c54685f into skypilot-org:master Apr 7, 2026
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SKILL#9196

Update SKILL#9196
romilbhardwaj merged 3 commits into
skypilot-org:masterfrom
alex000kim:update-agent-skill

alex000kim commented Mar 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Uh oh!

romilbhardwaj left a comment

Uh oh!

romilbhardwaj Apr 1, 2026

Uh oh!

alex000kim Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		job_id=$(sky queue exp-vm-01 -o json \
		\| python3 -c "import sys,json; jobs=json.load(sys.stdin)['exp-vm-01']; print(max(j['job_id'] for j in jobs))")

Conversation

alex000kim commented Mar 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

romilbhardwaj left a comment

Choose a reason for hiding this comment

Uh oh!

romilbhardwaj Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

alex000kim Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants