feat(inference): add Nemotron 3 Nano Omni to CLOUD_MODEL_OPTIONS (#2628)

PicoNVIDIA · claude · ericksoa · web-flow · commit e59231a82bdc · 2026-05-01T21:03:14.000-07:00
## Summary Adds `nvidia/nemotron-3-nano-omni-30b-a3b-reasoning` (label: *Nemotron 3 Nano Omni 30B*) to the curated cloud model picker. Super 120B remains the default. ## Motivation The multimodal `hermes-omni-demo` cookbook in [brevdev/nemoclaw-demos](https://github.com/brevdev/nemoclaw-demos/tree/main/hermes-omni-demo) currently has to do a post-onboard `openshell inference set --model private/nvidia/nemotron-3-nano-omni-30b-a3b-reasoning` to switch the gateway from Super to Omni, because the wizard only exposes Super. The current cookbook frames this as *"You picked Super 120B during onboarding because that's what the menu offers, but this cookbook needs Omni..."* — that workaround is awkward and reviewers in [brevdev/nemoclaw-demos#23](brevdev/nemoclaw-demos#23) have called it out as an abuse of the installer. Adding Omni here lets users select it during `nemoclaw onboard` directly and lets multimodal cookbooks drop the manual swap step entirely. ## Test plan - [ ] Existing `inference-config.test.ts` updated to include the new model id in the expected list — runs as part of `npm test` - [ ] `nemoclaw onboard --agent hermes` shows Omni as option in the model picker - [ ] Selecting it produces a sandbox with `Model: private/nvidia/nemotron-3-nano-omni-30b-a3b-reasoning` in `nemoclaw <name> status` ## Files changed - `src/lib/inference-config.ts` — one new entry in `CLOUD_MODEL_OPTIONS` - `src/lib/inference-config.test.ts` — matching expected-list update 🤖 Generated with [Claude Code](https://claude.com/claude-code)  ## Summary by CodeRabbit * **New Features** * NVIDIA Nemotron 3 Nano Omni 30B added to cloud model selection for users to choose. * **Tests** * Automated tests updated to include the new cloud model option and to reflect adjusted selection ordering used in onboarding and default-model scenarios.  Signed-off-by: Aaron Erickson <aerickson@nvidia.com> --------- Signed-off-by: Patrick Moorhead <pmoorhead@nvidia.com> Signed-off-by: Aaron Erickson <aerickson@nvidia.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: Aaron Erickson 🦞 <aerickson@nvidia.com>
diff --git a/src/lib/inference-config.test.ts b/src/lib/inference-config.test.ts
@@ -22,6 +22,7 @@ describe("inference selection config", () => {
   it("exposes the curated cloud model picker options", () => {
     expect(CLOUD_MODEL_OPTIONS.map((option: { id: string }) => option.id)).toEqual([
       "nvidia/nemotron-3-super-120b-a12b",
+      "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning",
       "z-ai/glm-5.1",
       "minimaxai/minimax-m2.7",
       "openai/gpt-oss-120b",
diff --git a/src/lib/inference-config.ts b/src/lib/inference-config.ts
@@ -12,6 +12,7 @@ export const INFERENCE_ROUTE_URL = "https://inference.local/v1";
 export const DEFAULT_CLOUD_MODEL = "nvidia/nemotron-3-super-120b-a12b";
 export const CLOUD_MODEL_OPTIONS = [
   { id: "nvidia/nemotron-3-super-120b-a12b", label: "Nemotron 3 Super 120B" },
+  { id: "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", label: "Nemotron 3 Nano Omni 30B" },
   { id: "z-ai/glm-5.1", label: "GLM-5" },
   { id: "minimaxai/minimax-m2.7", label: "MiniMax M2.7" },
   { id: "openai/gpt-oss-120b", label: "GPT-OSS 120B" },
diff --git a/src/lib/model-prompts.test.ts b/src/lib/model-prompts.test.ts
@@ -32,7 +32,7 @@ describe("model prompt helpers", () => {
   });
 
   it("returns DeepSeek V4 Pro from the default cloud model menu", async () => {
-    const promptFn = promptSequence(["5"]);
+    const promptFn = promptSequence(["6"]);
     const result = await promptCloudModel({
       promptFn,
       writeLine: vi.fn(),
diff --git a/test/onboard-selection.test.ts b/test/onboard-selection.test.ts
@@ -320,7 +320,7 @@ printf '%s' "$status"
 const credentials = require(${credentialsPath});
 const runner = require(${runnerPath});
 
-const answers = ["1", "5"];
+const answers = ["1", "6"];
 const messages = [];
 
 credentials.prompt = async (message) => {
@@ -421,7 +421,7 @@ printf '%s' "$status"
 const credentials = require(${credentialsPath});
 const runner = require(${runnerPath});
 
-const answers = ["1", "6", "custom/provider-model"];
+const answers = ["1", "7", "custom/provider-model"];
 const messages = [];
 
 credentials.prompt = async (message) => {
@@ -517,7 +517,7 @@ printf '%s' "$status"
 const credentials = require(${credentialsPath});
 const runner = require(${runnerPath});
 
-const answers = ["1", "6", "bad/model", "z-ai/glm-5.1"];
+const answers = ["1", "7", "bad/model", "z-ai/glm-5.1"];
 const messages = [];
 
 credentials.prompt = async (message) => {