CLI overrides

Any JobConfig field reachable through dotted attributes can be overridden from the command line with --section.key=value. Parsing happens in _parse_cli_overrides in kempnerforge/config/loader.py; type coercion happens in _coerce_value in the same file.

Syntax

uv run python scripts/train.py configs/train/debug.toml \
    --model.dim=512 \
    --train.compile_model=false \
    --optimizer.lr=1e-4 \
    --optimizer.betas="[0.9,0.95]" \
    --distributed.tp=2

Each flag must begin with --. Parsing splits once on =:

Left of = is the dotted path. It builds a nested dict: --model.dim=512 becomes {"model": {"dim": 512}}.
Right of = is the literal value. The loader calls ast.literal_eval on it. If that raises, the raw string is kept.

Value parsing

ast.literal_eval accepts Python literals:

CLI snippet	Parsed as
`--model.dim=512`	`int(512)`
`--optimizer.lr=3e-4`	`float(0.0003)`
`--train.compile_model=true`	`bool(True)` (via coercion — see below)
`--optimizer.betas=[0.9,0.95]`	`list → tuple(0.9, 0.95)` (coerced to tuple by field type)
`--checkpoint.load_path=/scratch/ckpt`	`str("/scratch/ckpt")` (literal_eval fails, keep as string)
`--data.anneal_weights={"c4":0.7,"books":0.3}`	`dict` (parsed into `DataConfig.anneal_weights`)

Quote any value the shell would otherwise eat:

--optimizer.betas="[0.9,0.95]"   # list literal with spaces/brackets
--data.dataset_path="/n/holylfs06/.../shards"

Lists of dataclasses (DataConfig.datasets, DataConfig.phases) are not ergonomic on the CLI — pass them in the TOML preset instead. The loader will accept them if you really want to, by expanding each entry to a dict literal, but the TOML version is readable:

[[data.datasets]]
path = "/scratch/c4"
weight = 0.7
name = "c4"

[[data.datasets]]
path = "/scratch/books"
weight = 0.3
name = "books"

Boolean shorthand

A flag with no = becomes True:

--train.compile_model     # equivalent to --train.compile_model=true

For False, write it explicitly: --train.compile_model=false.

The bool coercion also accepts the strings "true", "1", "yes" (case-insensitive) as True; everything else falls back to bool(value).

Type coercion

After ast.literal_eval produces a raw value, _coerce_value walks the dataclass type hint and coerces:

Field type	Input	Result
`int`	`"512"` or `3.0`	`int(…)`
`float`	`"3e-4"` or `1`	`float(…)`
`bool`	`"true"`, `"1"`, `"yes"`	`True`; else `bool(value)`
`tuple[float, float]`	`[0.9, 0.95]`	`(0.9, 0.95)`
`list[str]`	`"a"`	`["a"]` (wraps bare scalar)
`list[DatasetSource]`	`[{"path": …}]`	`[DatasetSource(path=…)]` (recursive)
`Literal["bf16", "fp16", "fp32", "fp8"]`	anything else	`ValueError`
`StrEnum` (`NormType`, `SchedulerType`, …)	`"rmsnorm"`	`NormType.rmsnorm`
`X \| None` (Optional)	`None` → `None`; else coerce to `X`

Literal fields reject unknown strings at coercion time — a typo like --train.mixed_precision=bfloat16 (should be "bf16") fails before training starts.

Unknown keys

_apply_dict_to_dataclass rejects keys not declared on the dataclass. A typo surfaces immediately:

$ uv run python scripts/train.py configs/train/debug.toml --model.dimm=512
ValueError: Unknown config keys in ModelConfig: ['dimm'].
Valid keys: ['activation', 'dim', ...]

Inspecting the final config

Print the resolved config before the training loop begins by adding the flag to scripts/train.py or dumping it from a REPL:

from kempnerforge.config import load_config
from dataclasses import asdict
import json

config = load_config("configs/train/debug.toml",
                     cli_args=["--model.dim=512"])
print(json.dumps(asdict(config), indent=2, default=str))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLI overrides

Syntax

Value parsing

Boolean shorthand

Type coercion

Unknown keys

Inspecting the final config

See also

FilesExpand file tree

cli-overrides.md

Latest commit

History

cli-overrides.md

File metadata and controls

CLI overrides

Syntax

Value parsing

Boolean shorthand

Type coercion

Unknown keys

Inspecting the final config

See also