You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/content.zh/docs/custom-resource/autoscaler.md
+13-3Lines changed: 13 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -156,8 +156,17 @@ A target of `0.6` means we are targeting 60% utilization/load for the job vertex
156
156
157
157
In general, it's not recommended to set target utilization close to 100% as performance usually degrades as we reach capacity limits in most real world systems.
158
158
159
-
In addition to the utilization target we can set a utilization boundary, that serves as extra buffer to avoid immediate scaling on load fluctuations.
160
-
Setting `job.autoscaler.target.utilization.boundary: "0.2"` means that we allow 20% deviation from the target utilization before triggering a scaling action.
159
+
To control when scaling actions are triggered, users can set upper and lower utilization thresholds using `job.autoscaler.utilization.max` and `job.autoscaler.utilization.min`.
160
+
These define the utilization range within which the autoscaler will not take any scaling action, providing a buffer against load fluctuations.
161
+
162
+
For example, with a target utilization of `0.6`, setting `job.autoscaler.utilization.max: "0.8"` and `job.autoscaler.utilization.min: "0.4"` means:
163
+
- Scale-up is triggered when utilization exceeds `0.8`
164
+
- Scale-down is triggered when utilization drops below `0.4`
165
+
- No scaling action is taken while utilization remains between `0.4` and `0.8`
166
+
167
+
{{<hintinfo >}}
168
+
The default thresholds are `target + 0.3` for max and `target - 0.3` for min.
Copy file name to clipboardExpand all lines: docs/content/docs/custom-resource/autoscaler.md
+13-3Lines changed: 13 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -156,8 +156,17 @@ A target of `0.6` means we are targeting 60% utilization/load for the job vertex
156
156
157
157
In general, it's not recommended to set target utilization close to 100% as performance usually degrades as we reach capacity limits in most real world systems.
158
158
159
-
In addition to the utilization target we can set a utilization boundary, that serves as extra buffer to avoid immediate scaling on load fluctuations.
160
-
Setting `job.autoscaler.target.utilization.boundary: "0.2"` means that we allow 20% deviation from the target utilization before triggering a scaling action.
159
+
To control when scaling actions are triggered, users can set upper and lower utilization thresholds using `job.autoscaler.utilization.max` and `job.autoscaler.utilization.min`.
160
+
These define the utilization range within which the autoscaler will not take any scaling action, providing a buffer against load fluctuations.
161
+
162
+
For example, with a target utilization of `0.6`, setting `job.autoscaler.utilization.max: "0.8"` and `job.autoscaler.utilization.min: "0.4"` means:
163
+
- Scale-up is triggered when utilization exceeds `0.8`
164
+
- Scale-down is triggered when utilization drops below `0.4`
165
+
- No scaling action is taken while utilization remains between `0.4` and `0.8`
166
+
167
+
{{<hintinfo >}}
168
+
The default thresholds are `target + 0.3` for max and `target - 0.3` for min.
0 commit comments