You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This configMap is used for global configurations, like resourceName, mode, templates.
70
-
*By setting `hamiVnpuCore: true`at the top level, **all nodes**will enable soft-partitioning based on `hami-vnpu-core`.
70
+
*Under `vnpus`, set `hamiVnpuCore: true`so **all nodes**advertise soft-partitioning based on `hami-vnpu-core` to the scheduler (unless overridden per node in `hami-device-node-config`).
The `hami-device-node-config` is used to enable or override hami-vnpu-core for specific nodes within the cluster. Node-level settings take higher priority than the global `hamiVnpuCore` switch.
81
+
The `hami-device-node-config` is used to enable or override hami-vnpu-core for specific nodes within the cluster. Node-level settings take higher priority than the global `vnpus.hamiVnpuCore` switch.
@@ -104,8 +104,14 @@ To exclusively use an entire card or request multiple cards, you only need to se
104
104
105
105
### Usage in HAMi
106
106
107
+
**How HAMi chooses soft vs legacy vNPU:** The device plugin applies **soft slicing** (`libvnpu` / `hami-vnpu-core` mounts and environment) **only** when the Pod sets `huawei.com/vnpu-mode: hami-core`. Pods **without** this annotation still follow the **original vNPU** path (virtualization templates and `ASCEND_VNPU_SPECS`). These two paths are different. If your cluster effectively has **only** soft-slicing–oriented Ascend capacity (for example every node is configured for `hami-vnpu-core` and workloads are expected to use soft slicing), Pods that **omit** `vnpu-mode=hami-core` may remain **Pending** because they still request the legacy vNPU allocation model, which may not match what those nodes expose or how the scheduler pairs Pods to nodes.
108
+
107
109
```yaml
108
110
...
111
+
metadata:
112
+
name: ascend-soft-slice-pod
113
+
annotations:
114
+
huawei.com/vnpu-mode: 'hami-core' # Enables hami-vnpu-core soft-segmentation for this pod
109
115
containers:
110
116
- name: npu_pod
111
117
...
@@ -120,6 +126,8 @@ For more examples, see [examples](https://github.com/Project-HAMi/ascend-device-
120
126
121
127
### Soft Slicing Configuration (HAMi)
122
128
129
+
Use the annotation below whenever you intend **soft** slicing; omitting it keeps **template-based vNPU** behavior (see the note under [Usage in HAMi](#usage-in-hami)).
0 commit comments