fix(tracer): prevent stale ctx.tracing crash on HTTPS keepalive connections by janiussyafiq · Pull Request #13232 · apache/apisix

janiussyafiq · 2026-04-16T09:11:52Z

Description

When apisix.tracing is enabled, the core tracer instruments every request phase — including ssl_client_hello_phase — by allocating a tracing table from lua-tablepool and storing it in ngx.ctx.tracing. On HTTPS keepalive connections, OpenResty reuses the same ngx.ctx object across multiple HTTP requests on the same TLS session.

The bug occurs in the following sequence:

ssl_client_hello_phase calls tracer.start(), which allocates ctx.tracing via tablepool and initialises tracing.spans.
The first HTTP request completes and tracer.release() is called in the log phase, returning the tracing table to the pool. lua-tablepool internally calls table.clear() on release, zeroing all fields — tracing.spans becomes nil — but ctx.tracing still holds a reference to this now-cleared table.
On the second HTTP request (same keepalive connection), tracer.start() finds ctx.tracing is non-nil (a cleared table is still truthy in Lua) and skips re-initialisation.
span.new() then crashes at table.insert(tracing.spans, self) because spans is nil.

This fix addresses the root cause at two layers:

tracer.start(): The initialisation guard is extended from if not tracing then to if not tracing or not tracing.spans then. Since lua-tablepool always zeroes tracing.spans on release via table.clear(), this reliably detects a stale or cleared tracing table and re-initialises it correctly — including on HTTPS keepalive second requests and any diverged HTTP/2 contexts.
tracer.release(): A if spans then guard is added before iterating and releasing the spans table, making release safe even when called on a partially-cleared state. The explicit nil assignments are intentionally avoided to preserve the tablepool contract — re-allocation and de-allocation of tables is expensive, and lua-tablepool already handles clearing internally.

Which issue(s) this PR fixes:

Fixes #13200

Checklist

I have explained the need for this PR and the problem it solves
I have explained the changes or the new features added to this PR
I have added tests corresponding to this change (t/node/tracer.t)
I have updated the documentation to reflect this change
I have verified that this change is backward compatible

…ctions

janiussyafiq added 2 commits April 16, 2026 16:36

fix(tracer): prevent stale ctx.tracing crash on HTTPS keepalive conne…

6dce496

…ctions

chore: remove long comments

415b570

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Apr 16, 2026

fix: remove expensive op dereference table

51d05e7

Baoyuantop approved these changes Apr 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(tracer): prevent stale ctx.tracing crash on HTTPS keepalive connections#13232

fix(tracer): prevent stale ctx.tracing crash on HTTPS keepalive connections#13232
janiussyafiq wants to merge 3 commits intoapache:masterfrom
janiussyafiq:fix/tracing-https

janiussyafiq commented Apr 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

janiussyafiq commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Which issue(s) this PR fixes:

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

janiussyafiq commented Apr 16, 2026 •

edited

Loading