You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Forward-port to 7.4: Add DD init and team collection logging for diagnosing slow startups (#13002)
* Add DD init visibility, metrics retries, shard tracking, scan progress, and team collection logging (#12913)
Add logging throughout DD startup and runtime to diagnose slow startups,
stuck data moves, and undesired server classification.
DDTxnProcessor: Log elapsed time for server list + data move read
transaction and keyServer scan. Warn when getRange(dataMoveKeys) takes
over 5 seconds.
DataDistribution: Add NumShards and NumServers to DDInitGotInitialDD.
Add DDInitResumedDataMoves summary with ValidMoves, CancelledMoves,
EmptyMoves counts and elapsed time. Log DD exit reason as DDExiting
at SevWarn. Add DDInit-prefixed trace events throughout startup sequence.
DDTeamCollection: Add Reason and Address details to UndesiredStorageServer
trace events to distinguish version lag, same-address, wrong-class, and
exclusion causes.
DDShardTracker: Log TrackInitialShardsComplete with shard count and
TrackInitialShardsMetricsComplete with elapsed time.
NativeAPI: Add retry counting and logging for getStorageMetrics timeouts.
* Remove double trace
* Fix DD trace event issues: rate-limit warns, deduplicate events, fix DDExiting
- Wire up unused lastLogTime in waitStorageMetrics to rate-limit SevWarn
events to once per 10s after the 60s threshold. Previously every retry
(up to 100/s with 10ms WRONG_SHARD_SERVER_DELAY) emitted SevWarn.
- Remove redundant DataDistributorRunning and DDInitRunning events inside
the try block. Rename the original DataDistributorRunning to DDInitRunning
so a single event serves both purposes and fits the DDInit* query pattern.
- Move DDExiting to the three actual throw sites so it only fires on
terminal exits, not on retries of movekeys_conflict/dd_config_changed
when DD is disabled.
* Trim over-apologetic DDInitDone comment
0 commit comments