Skip to content

Commit 52d8d22

Browse files
authored
Update local metadata to include new directoryPath and promptTemplate tags (#4152)
* Add directoryPath and promptTemplate for models that have them * Add remaining directoryPath tags * Add missing Phi model promptTemplate tags * Add missing Phi-4 and DeepSeek promptTemplate tags * Remove Phi-4-mini prompt templates for now * Fix promptTemplates for Phi-4 models
1 parent 9f50c1f commit 52d8d22

42 files changed

Lines changed: 66 additions & 0 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

assets/models/system/DeepSeek-R1-Distill-Qwen-1.5B-cuda-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-1.5b
15+
directoryPath: cuda-int4-rtn-block-32
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/DeepSeek-R1-Distill-Qwen-1.5B-generic-cpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-1.5b
15+
directoryPath: cpu-int4-rtn-block-32-acc-level-4
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/DeepSeek-R1-Distill-Qwen-1.5B-generic-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-1.5b
15+
directoryPath: directml-int4-rtn-block-32-acc-level-4
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/DeepSeek-R1-Distill-Qwen-7B-cuda-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-7b
15+
directoryPath: cuda-int4-rtn-block-32
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/DeepSeek-R1-Distill-Qwen-7B-generic-cpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-7b
15+
directoryPath: cpu-int4-rtn-block-32-acc-level-4
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/DeepSeek-R1-Distill-Qwen-7B-generic-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: deepseek-r1-7b
15+
directoryPath: directml-int4-rtn-block-32-acc-level-4
16+
promptTemplate: "{\"assistant\": \"{Content}\", \"prompt\": \"\\\\u003C\\\\uFF5CUser\\\\uFF5C\\\\u003E{Content}\\\\u003C\\\\uFF5CAssistant\\\\uFF5C\\\\u003E\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/Mistral-7B-Instruct-v0-2-cuda-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: mistral-7b-v0.2
15+
directoryPath: mistral-7b-instruct-v0.2-cuda-int4-rtn-block-32
16+
promptTemplate: "{\"system\": \"<s>\", \"user\": \"[INST]\\n{Content}\\n[/INST]\", \"assistant\": \"{Content}</s>\", \"prompt\": \"[INST]\\n{Content}\\n[/INST]\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/Mistral-7B-Instruct-v0-2-generic-cpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: mistral-7b-v0.2
15+
directoryPath: mistral-7b-instruct-v0.2-cpu-int4-rtn-block-32
16+
promptTemplate: "{\"system\": \"<s>\", \"user\": \"[INST]\\n{Content}\\n[/INST]\", \"assistant\": \"{Content}</s>\", \"prompt\": \"[INST]\\n{Content}\\n[/INST]\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/Mistral-7B-Instruct-v0-2-generic-gpu/spec.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: mistral-7b-v0.2
15+
directoryPath: mistralai_Mistral-7B-Instruct-v0.2
16+
promptTemplate: "{\"system\": \"<s>\", \"user\": \"[INST]\\n{Content}\\n[/INST]\", \"assistant\": \"{Content}</s>\", \"prompt\": \"[INST]\\n{Content}\\n[/INST]\"}"
1517
type: custom_model
1618
variantInfo:
1719
parents:

assets/models/system/Phi-4-cuda-gpu/spec.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ tags:
1212
task: chat-completion
1313
maxOutputTokens: 2048
1414
alias: phi-4
15+
directoryPath: cuda-int4-rtn-block-32
1516
type: custom_model
1617
variantInfo:
1718
parents:

0 commit comments

Comments
 (0)