Commit 2fbc950
committed
Add max_retries and retry_interval to [(api_)database] conf
Nova's default database connection retry settings (max_retries=-1,
retry_interval=10) cause nova-api pods to hang indefinitely when a
cell database is unavailable. This prevents the pods from becoming
ready within the Kubernetes startup probe window (60 seconds),
leading to a CrashLoopBackOff.
In RHOSO, Kubernetes provides its own higher-level retry mechanism
by killing and recreating pods that fail to start. This is
preferable to Nova retrying internally because it reports the
situation clearly via CR status fields and events, and allows
Kubernetes to reschedule the pod to another worker if needed.
Set max_retries to 3 and retry_interval to 1 second for both
[database] and [api_database] so that Nova gives up on an
unreachable database quickly and lets Kubernetes handle the
recovery.
Resolves: OSPRH-30130
Signed-off-by: melanie witt <melwittt@gmail.com>1 parent 7ba93ca commit 2fbc950
1 file changed
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
241 | 241 | | |
242 | 242 | | |
243 | 243 | | |
| 244 | + | |
| 245 | + | |
244 | 246 | | |
245 | 247 | | |
246 | 248 | | |
247 | 249 | | |
248 | 250 | | |
249 | 251 | | |
| 252 | + | |
| 253 | + | |
250 | 254 | | |
251 | 255 | | |
252 | 256 | | |
| |||
0 commit comments