Commit 44487d8
authored
fix: Set default embedding max batch size to 1024 (#1116)
This change lowers the default embedding request batch size based on
observed production-like logs under S0 throttling, where 2048 repeatedly
hit 429 and 1024 progressed successfully.
## Changes
- `EmbeddingRetryOptions.MaxEmbeddingBatchSize` default: `2048` ->
`1024`
- `AIOptions:EmbeddingRetry:MaxEmbeddingBatchSize` in
`EssentialCSharp.Web/appsettings.json`: `2048` -> `1024`
## Why
Recent run data showed sustained retry exhaustion at 2048 and successful
completion after adaptive downshift to 1024. Setting 1024 as the default
improves out-of-the-box behavior under throttled tiers while preserving
configurability.1 parent 920c021 commit 44487d8
2 files changed
Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | | - | |
| 42 | + | |
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | | - | |
| 28 | + | |
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
| |||
0 commit comments