Skip to content

Commit 06f6c76

Browse files
natkeCopilot
andauthored
Bump nemotron-speech-streaming-en-0.6b-generic-cpu to v3 (#4985)
Update genai_config.json with session_options to disable intra-op spinning for encoder, decoder, and joiner. This improves performance by adding: "session_options": {"session.intra_op.allow_spinning": "0"} Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
1 parent 72cb4d7 commit 06f6c76

2 files changed

Lines changed: 5 additions & 5 deletions

File tree

assets/models/foundrylocal/nemotron-speech-streaming-en-0.6b-generic-cpu/model.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
path:
22
container_name: models
3-
container_path: foundrylocal/models/nemotron-speech-streaming-en-0.6b/onnx/cpu_and_mobile/v2
3+
container_path: foundrylocal/models/nemotron-speech-streaming-en-0.6b/onnx/cpu_and_mobile/v3
44
storage_name: foundrylocalmodels
55
type: azureblob
66
publish:

assets/models/foundrylocal/nemotron-speech-streaming-en-0.6b-generic-cpu/spec.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
$schema: https://azuremlschemas.azureedge.net/latest/model.schema.json
22
name: nemotron-speech-streaming-en-0.6b-generic-cpu
3-
version: 2
3+
version: 3
44
path: ./
55
tags:
66
foundryLocal: "test"
@@ -12,7 +12,7 @@ tags:
1212
task: automatic-speech-recognition
1313
maxOutputTokens: 2048
1414
alias: nemotron-speech-streaming-en-0.6b
15-
directoryPath: v2
15+
directoryPath: v3
1616
promptTemplate: ""
1717
capabilities: ""
1818
supportsReasoning: ""
@@ -30,5 +30,5 @@ variantInfo:
3030
quantization: ['RTN']
3131
device: 'cpu'
3232
executionProvider: 'CPUExecutionProvider'
33-
fileSizeBytes: 730746286
34-
vRamFootprintBytes: 730746286
33+
fileSizeBytes: 730746534
34+
vRamFootprintBytes: 730746534

0 commit comments

Comments
 (0)