Skip to content

feat: add LLM Gateway kimi-k2.7-code, nemotron-3-ultra-550b, grok-build-0-1#2551

Merged
rekram1-node merged 2 commits into
anomalyco:devfrom
smakosh:feat/llmgateway-newest-text-models
Jun 14, 2026
Merged

feat: add LLM Gateway kimi-k2.7-code, nemotron-3-ultra-550b, grok-build-0-1#2551
rekram1-node merged 2 commits into
anomalyco:devfrom
smakosh:feat/llmgateway-newest-text-models

Conversation

@smakosh

@smakosh smakosh commented Jun 13, 2026

Copy link
Copy Markdown
Contributor

Adds the newest text models from the LLM Gateway catalog that aren't yet in the repo, using the base_model structure to inherit from the canonical model registry with gateway-specific cost overrides.

Models added (under providers/llmgateway/models/)

Model base_model Released Cost (in/out, $/1M)
kimi-k2.7-code moonshotai/kimi-k2.7-code 2026-06-12 0.95 / 4 (cache_read 0.19, [interleaved])
nemotron-3-ultra-550b nvidia/nemotron-3-ultra-550b-a55b 2026-06-04 0.5 / 2.5 (cache_read 0.15)
grok-build-0-1 xai/grok-build-0.1 2026-04-16 1 / 2 (cache_read 0.2, +200k context tier 2/4/0.4)

Notes

  • Cost values reflect LLM Gateway's pricing; all other metadata (limits, modalities, family, reasoning) is inherited from the existing registry base specs.
  • Image / video / TTS / embedding models from the gateway are intentionally excluded — this PR is text models only.
  • bun validate passes; all three resolve correctly.

🤖 Generated with Claude Code

…ld-0-1

Newest text models from the LLM Gateway catalog, using the base_model
structure to inherit from the canonical model registry with gateway-specific
cost overrides.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@rekram1-node

Copy link
Copy Markdown
Collaborator

plz add reasoning_options

Addresses review feedback: kimi-k2.7-code and grok-build-0-1 use the
effort (low/medium/high) option matching the kimi/grok gateway models;
nemotron-3-ultra-550b uses a reasoning toggle per its nvidia source.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@smakosh

smakosh commented Jun 14, 2026

Copy link
Copy Markdown
Contributor Author

Done — added reasoning_options to all three:

  • kimi-k2.7-code and grok-build-0-1: [{ type = "effort", values = ["low", "medium", "high"] }] (matching the other kimi/grok gateway models)
  • nemotron-3-ultra-550b: [{ type = "toggle" }] per its nvidia source definition

bun validate passes.

@rekram1-node rekram1-node merged commit 8928213 into anomalyco:dev Jun 14, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants