Skip to content

Addr2line enhance 4#4976

Open
TianlongLiang wants to merge 2 commits into
bytecodealliance:mainfrom
TianlongLiang:addr2line_enhance_4
Open

Addr2line enhance 4#4976
TianlongLiang wants to merge 2 commits into
bytecodealliance:mainfrom
TianlongLiang:addr2line_enhance_4

Conversation

@TianlongLiang

Copy link
Copy Markdown
Contributor

No description provided.

…lver

Refactor the wasm symbolicator to make its toolchain workarounds
explicit. addr2line.py now detects the wasi-sdk's clang major version
at startup and routes per-frame address resolution to one of:

- resolve_address_modern (clang >= 22, wasi-sdk 33+):
    single llvm-symbolizer call per address. The symbolizer handles
    address-to-name correctly on this clang.

- resolve_address_legacy (clang < 22 or unknown):
    llvm-symbolizer call + llvm-dwarfdump --lookup overlay to fix
    the outermost function name. Older clang versions emit wasm DWARF
    that confuses llvm-symbolizer's address-to-name resolver for some
    addresses (e.g., reports 'recurse' as 'free'). --lookup goes
    through a different DWARF traversal path that handles them.

The dispatch decision is logged once to stderr so users can see
which path was taken without polluting stdout. Stdout is byte-
identical between paths for non-buggy inputs.

Also includes accumulated improvements:
- --mode flag {interp,aot,fast-interp} for different runtime offset
  conventions (interp post-advance, aot at-instruction-start,
  fast-interp's transformed in-memory bytecode)
- Inline frame annotation "(inlined into <next>)" for clarity
- llvm-symbolizer preferred over llvm-addr2line for column info
- Fallback for offset=0 (trap at function entry; frame_ip not captured)
- Last-resort function-index name fallback when DWARF lacks PC ranges
New test suite at test-tools/addr2line/tests/ exercising addr2line.py
against purpose-built C/C++ sources covering:

- Baseline single-function resolution
- Inline expansion (always_inline, 4-level deep chain)
- Cross-TU LTO inlining (multi-file recursion + wasm-opt -Oz -g)
- Trap inside loop body (DWARF line-table edge case)
- Multi-frame call stack
- C++ symbol demangling
- AOT mode offset math
- fast-interp / --no-addr fallbacks
- offset=0 fallback (trap at function entry)
- Empty input
- Version-dispatch stderr message
- Multi-SDK legacy/modern equivalence (opt-in via --multi-sdk)

Layout:
  test-tools/addr2line/tests/
  ├── README.md              -- documentation
  ├── conftest.py            -- pytest fixtures (sdk discovery, build,
  │                            run_addr2line invocation, multi-sdk
  │                            parametrization)
  ├── test_addr2line.py      -- 14 test cases
  ├── pytest.ini             -- marker definitions (slow, multi_sdk)
  ├── run_tests.sh           -- thin pytest wrapper
  ├── apps/                  -- 8 purpose-built C/C++ sources
  └── fixtures/              -- 3 plaintext call-stack inputs

Sources under apps/ are NOT copied from samples/; they target specific
edge cases independent of sample evolution.

Depends on the addr2line.py refactor (test_dispatch_message_in_stderr
checks the modern/legacy stderr message; test_modern_legacy_equivalence
verifies both paths agree on output).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant