Skip to content

spec: canonical canon_name format \u2014 mandatory numeric format token #93

@gHashTag

Description

@gHashTag

Canonical canon_name spec (per operator instruction 2026-05-02 + trios#445)

Format

IGLA-{LANE}-{format}-h{H}-LR{LR4}-rng{SEED}-{TAG}
Token Source Example
IGLA- mandatory prefix IGLA-
{LANE} wave + phase WAVE-GF-001-PHASE1, BLITZ-T10H, CHAMPION-acc0new, RAILWAY-LEADER
{format} MANDATORY numeric format token binary32, binary16, bfloat16, TF32, FP8-E4M3, FP8-E5M2, GF4, GF8, GF16, GF32, Posit8, Posit16, Posit32, INT4, INT8, INT16, UINT8, NF4, MXFP8, MXFP6, MXFP4, AFP, LNS, ...
h{H} hidden dim h384, h828
LR{LR4} LR \u00d7 10000 zero-padded 4-digit LR0030 (lr=0.0030), LR0573 (lr=0.0573)
rng{SEED} seed rng34, rng1597
{TAG} optional experiment tag k3 (phi-LR ladder index), step27000, t55803-001

Canonical numeric format catalog (mandatory)

binary16, binary32, binary64, binary128, binary256, decimal32, decimal64, decimal128, FP80, double-double, quad-double, bfloat16, TF32, FP8-E4M3, FP8-E5M2, FP6-E3M2, FP6-E2M3, FP4-E2M1, MXFP8, MXFP6, MXFP4, NF4, AFP, Posit8, Posit16, Posit32, Posit64, LNS, GF4, GF8, GF12, GF16, GF20, GF24, GF32, GF64, INT4, INT8, INT16, INT32, INT64, INT128, UINT4, UINT8, UINT16, UINT32, UINT64, UINT128, Q-format, BCD, IBM-HFP, MBF, VAX-F, VAX-D, VAX-G, VAX-H, Cray-float, minifloat, Unum-I, Unum-II, tapered-fp, block-fp, shared-exp, stochastic-rounding

Mapping from config_json.trainer.format token

config token canon token
fp16, f16 binary16
fp32, f32 binary32
fp64, f64 binary64
bf16 bfloat16
tf32 TF32
fp8_e4m3 FP8-E4M3
fp8_e5m2 FP8-E5M2
gf16 GF16
posit32 Posit32
... (full list above)

Examples

  • IGLA-WAVE-GF-001-PHASE1-binary32-h384-LR0573-rng34-k3 \u2014 phi-LR ladder, fp32 baseline, k=3
  • IGLA-WAVE-GF-001-PHASE4-GF16-h384-LR0040-rng34 \u2014 Phase-4 GF16 dtype sweep
  • IGLA-CHAMPION-binary32-h828-LR0030-rng43-step27000 \u2014 champion repro
  • IGLA-BLITZ-T10H-binary32-h256-LR0020-rng45-t55803-001 \u2014 legacy BLITZ wave (canon was missing format token; future renames should add it)

Rationale

The numeric format is the primary independent variable in the IGLA RACE / WAVE-GF-001 / Universal Numeric-Format Catalog work (zig-golden-float#69). A canon_name without the format token is meaningless because phi-LR ladder convergence, BPB, and Pareto-frontier comparisons are all keyed by format.

Forbidden

  • Names without an explicit format token: GF-LR-k3-rng34, SMOKE-PR82-FULLCYCLE-rng34, DIRECT-SMOKE-PR86-rng34, VERIFY-PR88-FRESH-rng99, trios-train-rng43
  • Lowercase format tokens: fp32, bf16 \u2014 must be binary32, bfloat16
  • Ad-hoc tokens not in the canonical catalog above
  • Truncated names (IGLA-rng34)

Enforcement

  • CI \u2014 strategy_queue insert trigger validates canon_name regex ^IGLA-[A-Z0-9-]+-({CANON_FMT_RE})-h\d+-LR\d{4}-rng\d+(-[a-z0-9]+)*$
  • scarab \u2014 reject claim if canon_name does not match regex (logs and skips)
  • Operator \u2014 retroactive rename of any non-canonical rows in bpb_samples + strategy_queue

Retroactive cleanup applied 2026-05-02

Old New
GF-LR-k3-rng34 IGLA-WAVE-GF-001-PHASE1-binary32-h384-LR0573-rng34-k3
(\u00d7 9 phi-LR rows in strategy_queue 2062..2070)
80 bpb_samples rows renamed
26 SMOKE/VERIFY/DIRECT rows purged

References

\ud83c\udf3b phi^2 + phi^-2 = 3 \u00b7 TRINITY \u00b7 NEVER STOP

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions