You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
WorldModel Gym is a production-ready benchmark platform for long-horizon planning agents. It combines reproducible environments, planner/world-model baselines, a FastAPI submission service, and a Next.js leaderboard dashboard into one deployable monorepo.
3
+
WorldModel Gym is an end-to-end benchmark platform for long-horizon planning agents. It combines reproducible benchmark environments, planner and world-model baselines, a FastAPI submission service, and a polished Next.js leaderboard into one deployable monorepo.
4
4
5
-
## Screenshots
5
+
## Why This Repo Stands Out
6
6
7
-

7
+
- Reproducible benchmark tasks designed around sparse rewards, partial observability, and procedural generalization
8
+
- Modular research stack spanning environments, agents, planners, and world models
9
+
- Production-minded backend with Alembic migrations, scoped API keys, rate limiting, readiness checks, structured logging, and Prometheus metrics
10
+
- Modern frontend with a custom editorial product UI, same-origin proxying, SEO metadata, and Playwright smoke coverage
11
+
- Full-stack delivery workflow with GitHub Actions, Render deployment support, and Vercel deployment support
- Reproducible benchmark tasks for sparse rewards, partial observability, and procedural generalization
14
-
- FastAPI backend with Alembic migrations, scoped API keys, rate limiting, readiness checks, and structured logging
15
-
- Pluggable artifact storage with local and S3-compatible backends
16
-
- Next.js dashboard with proxy-based API access, seeded demo data support, metadata/SEO, and Playwright smoke tests
17
-
- CI coverage for Ruff, pytest, Next.js production builds, and browser E2E verification
18
-
19
-
## Quickstart
20
-
21
-
```bash
22
-
make setup
23
-
make demo
24
-
```
25
-
26
-
Local development uses built-in defaults. If you need overrides, export environment variables in your shell instead of committing env files to the repo.
27
-
28
-
`make demo` will:
29
-
30
-
- start the API + web stack with Docker when available
31
-
- fall back to local API execution when Docker is unavailable
Local development uses built-in defaults. If you need overrides, export environment variables in your shell or configure them in your deployment provider. Do not commit env files to the repository.
116
+
117
+
## Developer Commands
118
+
119
+
```bash
120
+
make lint
121
+
make test
122
+
make demo
123
+
make seed-demo
124
+
make create-api-key NAME=local-writer SCOPE=runs:write
0 commit comments