Fine-tuning, trajectory management, and prompt-optimization infrastructure for Eliza agents.
Adds data collection, trajectory export, prompt optimization, GPU training orchestration, benchmark evaluation, and a fine-tuning dashboard to an Eliza agent. Loaded as a plugin via trainingPlugin (exported from src/setup-routes.ts). The runtime hook entry-point is registerTrainingRuntimeHooks (src/register-runtime.ts), which registers OptimizedPromptService, nightly cron jobs, and the TrainingTriggerService. There is no automatic enable — the host runtime must call registerTrainingRuntimeHooks and register the plugin's routes.
| Group | Paths |
|---|---|
| Training | GET /api/training/status, `GET |
| Vast.ai | `GET |
| Trajectories | `GET |
| Experience | See EXPERIENCE_ROUTE_PATHS in src/routes/experience-routes.ts |
training— Fine-tuning dashboard (FineTuningView).developerOnly: true.training(xr) — Same view in XR surface.training/tui— Terminal variant (FineTuningTuiView).
TrainingTriggerService(TRAINING_TRIGGER_SERVICE = "training_trigger_service") — counts completed trajectories per task, fires prompt optimization when the threshold is reached.TrainingService— public training API; reads trajectories from the runtime, builds privacy-filtered export bundles.VastTrainingService— Vast.ai GPU job orchestration (spawntrain_vast.sh,eval_checkpoint.py).
- Trajectory export cron — nightly, bucketizes trajectories into per-task JSONL under
<state>/training/datasets/<YYYY-MM-DD>/, then optionally uploads to HuggingFace. - Skill scoring cron — nightly, scores active skills against recent trajectories and updates
SKILL.mdfrontmatter.
| Name | File |
|---|---|
instruction-search |
src/optimizers/instruction-search.ts |
prompt-evolution |
src/optimizers/prompt-evolution.ts |
gepa |
src/optimizers/gepa.ts |
bootstrap-fewshot |
src/optimizers/bootstrap-fewshot.ts |
dspy-bootstrap-fewshot |
src/dspy/optimizers/dspy-bootstrap-fewshot.ts |
dspy-copro |
src/dspy/optimizers/dspy-copro.ts |
dspy-mipro |
src/dspy/optimizers/dspy-mipro.ts |
All optimizers consume eliza_native_v1 JSONL rows and write artifacts to <stateDir>/optimized-prompts/ via OptimizedPromptService.
src/
index.ts Re-exports everything
setup-routes.ts trainingPlugin definition (routes + views)
register-runtime.ts registerTrainingRuntimeHooks — call at agent boot
cli/
train.ts CLI entry: `bun run train`
core/
training-config.ts TrainingConfig, loadTrainingConfig, saveTrainingConfig
training-orchestrator.ts triggerTraining, listRuns, loadRun, recordRun
training-collection-runner.ts Full collection pipeline (HF ingest + feeds + scenarios + benchmarks)
trajectory-export-bundle.ts Privacy-filtered export bundle builder
trajectory-export-cron.ts Nightly export cron registration
trajectory-hf-upload.ts HuggingFace JSONL uploader (shells out to hf CLI)
trajectory-task-datasets.ts Per-task JSONL export (eliza_native_v1 format)
privacy-filter.ts Anonymizer + PII/credential/geo stripping
skill-scoring-cron.ts Nightly skill eval cron
dataset-generator.ts Dataset generation utilities
scenario-runner.ts Scenario execution harness
action-benchmark-runner.ts Eliza-1 action benchmark runner
eliza1-benchmark-recipe.ts Tier/variant definitions for Eliza-1 benchmarks
cli.ts Data collection CLI (run-collection, list-collections)
context-catalog.ts Context catalog builder
context-audit.ts Context audit
replay-validator.ts Skill scoring against trajectory replays
backends/
native.ts Native in-process optimizer backend
optimizers/
instruction-search.ts Instruction-search optimizer
prompt-evolution.ts Prompt-evolution (genetic) optimizer
gepa.ts GEPA (Pareto+feedback) optimizer
bootstrap-fewshot.ts Bootstrap few-shot optimizer
scoring.ts createPromptScorer, scorePlannerAction
types.ts OptimizerName, OptimizerResult, etc.
dspy/
optimizers/ DSPy-native variants (bootstrap, COPRO, MIPRO)
signature.ts Signature DSL
predict.ts Predict module
lm-adapter.ts Runtime → DSPy LM adapter
routes/
training-routes.ts /api/training/* handlers
training-vast-routes.ts /api/training/vast/* handlers
trajectory-routes.ts /api/trajectories/* handlers
experience-routes.ts Experience service routes
services/
training-service.ts TrainingService class
training-trigger.ts TrainingTriggerService + bootstrapOptimizationFromAccumulatedTrajectories
training-vast-service.ts VastTrainingService
training-service-registry.ts getActiveTrainingService / setActiveTrainingService
training-backend-check.ts detectAvailableBackends
vast-job-store.ts VastJobStore (JSONL job state)
vast-inference-stats.ts Inference stats parsing
vast-subprocess.ts runCapture / runDetachedToLog
ui/
FineTuningView.tsx Dashboard React component
fine-tuning-panels.tsx Panel sub-components
bun run --cwd plugins/plugin-training train # Run native optimizer CLI
bun run --cwd plugins/plugin-training collect # Run data collection (CLI)
bun run --cwd plugins/plugin-training test # Run vitest suite
bun run --cwd plugins/plugin-training test:watch # Vitest in watch mode
bun run --cwd plugins/plugin-training build # Full build (JS + views + types)
bun run --cwd plugins/plugin-training build:js # tsup JS bundle only
bun run --cwd plugins/plugin-training build:views # Vite views bundle only
bun run --cwd plugins/plugin-training build:types # TypeScript declarations
bun run --cwd plugins/plugin-training clean # Remove dist/| Var | Required | Purpose |
|---|---|---|
ELIZA_STATE_DIR |
no | State root override (default ~/.eliza) |
ELIZA_DISABLE_TRAINING_CRONS |
no | Set to 1/true/yes to skip cron registration |
ELIZA_DISABLE_AUTO_BOOTSTRAP |
no | Disable auto-bootstrap of prompt optimization on start |
TRAIN_MODEL |
no | Model id for native optimizer (overrides runtime default) |
TRAIN_MODEL_PROVIDER |
no | Provider for TRAIN_MODEL |
TRAIN_OPTIMIZER |
no | Default optimizer name |
TRAINING_PROVIDER |
no | Provider used during training runs |
ELIZA_TRAJECTORY_HF_REPO |
no | org/dataset — enables HuggingFace JSONL upload after nightly export |
HF_TOKEN |
no | HuggingFace token (canonical). Fallbacks: HUGGINGFACE_HUB_TOKEN, HUGGING_FACE_HUB_TOKEN |
ELIZA_TRAJECTORY_DIR |
no | Override trajectory storage dir |
ELIZA_TEST_TRAJECTORY_DIR |
no | Override dir for test trajectory collection |
ELIZA_ACTION_BENCHMARK_TRAJECTORY_DIR |
no | Override dir for action benchmark trajectories |
ELIZA_INFERENCE_STATS_PATH |
no | Override inference stats JSONL path |
ELIZA_VAST_MAX_USD |
no | Budget cap for Vast.ai GPU jobs |
ANTHROPIC_API_KEY |
no | Required for Anthropic-backed optimizer runs |
OPENAI_API_KEY |
no | Required for OpenAI-backed optimizer runs |
CEREBRAS_API_KEY |
no | Required for Cerebras benchmark comparisons |
CEREBRAS_MODEL |
no | Model id for Cerebras benchmark |
LOCAL_LLAMA_CPP_API_KEY |
no | API key for local llama.cpp endpoint |
Training config is persisted at <stateDir>/training/config.json. Key fields: autoTrain (bool, default true), triggerThreshold (int, default 100 trajectories per task), triggerCooldownHours (default 12), backends (default ["native"]), perTaskOverrides.
Add a new optimizer:
- Create
src/optimizers/<name>.tsimplementing the optimizer function. - Export it from
src/optimizers/index.ts. - Add the name to
OptimizerNameunion insrc/optimizers/types.ts. - Wire it in
src/backends/native.tswhereNATIVE_OPTIMIZERSand the dispatch switch live. - Add it to the
--optimizerhelp text insrc/cli/train.ts.
Add a new API route group:
- Create
src/routes/<group>-routes.tswith a handler function. - Export the handler and path list from
src/routes/index.ts. - Register paths and handler in
src/setup-routes.tsfollowing the existingTRAINING_ROUTES/VAST_ROUTESpattern.
Add a new service:
- Create
src/services/<name>.ts. - Export from
src/services/index.ts. - Register via
runtime.registerService(...)inregisterTrainingRuntimeHooksif it must be available at runtime.
- No actions or providers. This plugin registers only routes, views, and services. There are no
actions,providers, orevaluatorsfields ontrainingPlugin. - Privacy filter is mandatory before any disk write or upload. Always run
applyPrivacyFilter/buildTrajectoryExportBundle— never write raw trajectories. - Native backend is in-process. It calls
runtime.useModel(ModelType.TEXT_LARGE, ...)directly. No HTTP server subprocess. - Vast.ai backend shells out to Python/bash scripts under
eliza/packages/training/scripts/. That directory must exist at runtime for Vast routes to work. ELIZA_DISABLE_TRAINING_CRONS=1must be set manually in tests to avoid cron registration side-effects; it is not set automatically by the framework.trainingPlugin.nameis@elizaos/plugin-training-routes(not@elizaos/plugin-training) — this is the name the route registry sees.- Views are
developerOnly: true. They do not appear in production without an explicit developer-mode flag. - The build has two distinct bundles: JS (tsup) and views (Vite,
vite.config.views.ts). Runbuild:jsandbuild:viewsseparately when iterating.