Skip to content

Commit 9f804a8

Browse files
committed
feat: general-purpose Daytona runner, GHCR base images, and task registry
Add scripts/daytona_runner.py — a complete benchmark runner that creates Daytona sandboxes dynamically from task Dockerfiles, runs Claude Code, verifies results, and cleans up. Supports all 4 config modes, parallel execution, OAuth/API key auth, dry-run validation, and suite selection. Supporting changes: - scripts/build_daytona_registry.py: generates daytona_task_registry.json (283 tasks, 249 Daytona-ready across 19 suites) - scripts/push_base_images_ghcr.sh: build-push-clean script for 22 ccb-repo base images to ghcr.io/sjarmak/ - base_images/Dockerfile.*: fix 8 Dockerfiles to use git clone instead of COPY from nonexistent _clone_cache directory - base_images/update_task_dockerfiles.py: use GHCR prefix for rewrites - 108 task Dockerfiles: FROM ccb-repo-* → FROM ghcr.io/sjarmak/ccb-repo-* - docs/DAYTONA.md: setup guide covering credentials, configs, and usage
1 parent 6616eb1 commit 9f804a8

File tree

122 files changed

+21805
-142
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

122 files changed

+21805
-142
lines changed

base_images/Dockerfile.containerd-317286ac

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/containerd-317286ac/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/containerd--317286ac.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.etcd-d89978e8

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/etcd-d89978e8/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/etcd--d89978e8.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.flask-798e006f

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/flask-798e006f/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/flask--798e006f.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.numpy-a639fbf5

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/numpy-a639fbf5/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/numpy--a639fbf5.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.pandas-41968da5

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/pandas-41968da5/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/pandas--41968da5.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.requests-421b8733

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/requests-421b8733/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/requests--421b8733.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.rust-01f6ddf7

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/rust-01f6ddf7/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/rust--01f6ddf7.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/Dockerfile.scikit-learn-cb7e82dd

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,6 @@ RUN apt-get update && apt-get install -y --no-install-recommends \
88
curl \
99
&& rm -rf /var/lib/apt/lists/*
1010

11-
COPY _clone_cache/scikit-learn-cb7e82dd/ /workspace/
12-
13-
RUN cd /workspace && \
11+
RUN git clone --depth 1 https://github.com/sg-evals/scikit-learn--cb7e82dd.git . && \
1412
git config user.email "agent@example.com" && \
1513
git config user.name "Agent"

base_images/update_task_dockerfiles.py

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -213,9 +213,12 @@ def extract_task_specific_lines(lines: list[str], base: dict) -> list[str]:
213213
return result
214214

215215

216+
GHCR_PREFIX = "ghcr.io/sjarmak"
217+
218+
216219
def rewrite_dockerfile(path: Path, base: dict, task_lines: list[str], dry_run: bool) -> bool:
217-
"""Rewrite a task Dockerfile to use a base image."""
218-
new_lines = [f"FROM {base['tag']}"]
220+
"""Rewrite a task Dockerfile to use a base image from GHCR."""
221+
new_lines = [f"FROM {GHCR_PREFIX}/{base['tag']}"]
219222

220223
if task_lines:
221224
new_lines.append("")
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
FROM ccb-repo-camel-1006f047
1+
FROM ghcr.io/sjarmak/ccb-repo-camel-1006f047

0 commit comments

Comments
 (0)