perf: replace dis.get_instructions with direct co_code parsing in from_code by P403n1x87 · Pull Request #194 · MatthieuDartiailh/bytecode

P403n1x87 · 2026-05-08T13:34:52Z

dis.get_instructions performs two full passes over the bytecode:

_make_labels_map → findlabels → _unpack_opargs (to build a jump-label map)
_get_instructions_bytes (to iterate instructions with full metadata)

Neither pass is needed here. ConcreteBytecode.from_code only needs the opname, raw arg byte, and source positions for each instruction word — all of which are directly available from co_code and co_positions().

CACHE entries are already inline in co_code on all supported Python versions, so direct 2-byte iteration handles them naturally without the per-version cache_info loop that 3.13 previously required.

Throughput (round-trips of Bytecode.from_code().to_code() on the dis module's own code object, timed over 1 second, 3 runs each):

Before: 92–94 round-trips/s
After: 107–111 round-trips/s (~+17%)

Own CPU time figures:

Function	Before	After
`dis._unpack_opargs`	5.98%	0.0%
`dis._get_instructions_bytes`	3.45%	0.0%
`ConcreteBytecode.from_code`	3.63%	4.91%

…m_code dis.get_instructions performs two full passes over the bytecode: - _make_labels_map → findlabels → _unpack_opargs (to build a jump-label map) - _get_instructions_bytes (to iterate instructions with full metadata) Neither pass is needed here. ConcreteBytecode.from_code only needs the opname, raw arg byte, and source positions for each instruction word — all of which are directly available from co_code and co_positions(). CACHE entries are already inline in co_code on all supported Python versions, so direct 2-byte iteration handles them naturally without the per-version cache_info loop that 3.13 previously required. Throughput (round-trips of Bytecode.from_code().to_code() on the dis module's own code object, timed over 1 second, 3 runs each): Before: 92–94 round-trips/s After: 107–111 round-trips/s (~+17%) Austin CPU profile figures: dis._unpack_opargs: 5.98% own → eliminated dis._get_instructions_bytes: 3.45% own → eliminated ConcreteBytecode.from_code: 3.63% own → 4.91% own

codecov-commenter · 2026-05-08T13:35:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 95.17%. Comparing base (9de3e78) to head (9f53642).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #194      +/-   ##
==========================================
- Coverage   95.21%   95.17%   -0.05%     
==========================================
  Files           7        7              
  Lines        2048     2051       +3     
  Branches      448      446       -2     
==========================================
+ Hits         1950     1952       +2     
- Misses         54       55       +1     
  Partials       44       44

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

MatthieuDartiailh

Very nice !!! Getting rid of dis without complications is something I wished I had known was possible.

A couple of comments but LGTM

Co-authored-by: Matthieu Dartiailh <marul@laposte.net>

P403n1x87 · 2026-05-08T14:55:24Z

Throughput up to ~130 after updating the PR

Add two fast-path factory methods that skip validation by using object.__new__ + direct slot assignment, for call sites where the inputs are already known to be valid: **InstrLocation._from_tuple** — replaces InstrLocation(...) at four internal sites where positions come from trusted sources (existing InstrLocation.lineno, SetLineno.lineno, first_lineno): - ConcreteBytecode.to_bytecode (fallback lineno-only location) - ConcreteBytecode._pack_location (propagated from existing location) - _ConvertBytecodeToConcrete.concrete_instructions (first_lineno seed and SetLineno-derived locations) **BaseInstr._from_trusted** — replaces Instr(name, arg, location=loc) in ConcreteBytecode.to_bytecode, where name/opcode/arg/location are all derived from already-validated ConcreteInstr objects. CPU own-time profile data: | Hotspot | Before | After | |---|---|---| | `ConcreteBytecode.to_bytecode` | 5.98% | 5.07% | | `Instr._check_arg` | 2.87% | eliminated | | `BaseInstr._set` (via to_bytecode) | 1.48% | eliminated | | `BaseInstr._from_trusted` | — | <1% (not in top 20) | Throughput (Bytecode.from_code().to_code() on dis module's code object, 1 second timed window, 5 runs): | | r/s range | |---|---| | Before | 103–108 | | After | 109–114 |

MatthieuDartiailh · 2026-05-08T18:30:16Z

+            Tuple[Optional[int], Optional[int], Optional[int], Optional[int]]
+        ] = iter(code.co_positions())
+        for offset in range(0, len(bc), 2):
+            arg = bc[offset + 1] if opcode_has_argument(op := bc[offset]) else UNSET


Won't this be problematic for Cython ?

It shouldn't be, the issue is specific to certain ad-hoc optimisations (cf. cython/cython#7670)

Actually in this particular occurance I do not find the walrus very readable. Could you go back to first fetching the op and then using it to get the arg ?

…ocation-overhead

P403n1x87 marked this pull request as ready for review May 8, 2026 13:48

Merge branch 'main' into perf/avoid-dis-location-overhead

75233b5

MatthieuDartiailh reviewed May 8, 2026

View reviewed changes

Comment thread src/bytecode/concrete.py Outdated

Comment thread src/bytecode/concrete.py Outdated

P403n1x87 and others added 2 commits May 8, 2026 15:48

Update src/bytecode/concrete.py

daa1193

Co-authored-by: Matthieu Dartiailh <marul@laposte.net>

assume co_positions always available

483c683

P403n1x87 requested a review from MatthieuDartiailh May 8, 2026 14:56

MatthieuDartiailh reviewed May 8, 2026

View reviewed changes

P403n1x87 and others added 3 commits May 8, 2026 19:49

Merge branch 'perf/avoid-location-revalidation' into perf/avoid-dis-l…

2d54d73

…ocation-overhead

use faster _from_tuple

72fcf31

Merge branch 'main' into perf/avoid-dis-location-overhead

aea50fd

P403n1x87 requested a review from MatthieuDartiailh May 8, 2026 19:06

undo walrus

9f53642

MatthieuDartiailh approved these changes May 9, 2026

View reviewed changes

MatthieuDartiailh merged commit 25bf1bc into MatthieuDartiailh:main May 9, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: replace dis.get_instructions with direct co_code parsing in from_code#194

perf: replace dis.get_instructions with direct co_code parsing in from_code#194
MatthieuDartiailh merged 9 commits into
MatthieuDartiailh:mainfrom
P403n1x87:perf/avoid-dis-location-overhead

P403n1x87 commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 •

edited

Loading

Uh oh!

MatthieuDartiailh left a comment

Uh oh!

Uh oh!

Uh oh!

P403n1x87 commented May 8, 2026

Uh oh!

MatthieuDartiailh May 8, 2026

Uh oh!

P403n1x87 May 8, 2026

Uh oh!

MatthieuDartiailh May 8, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

P403n1x87 commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

MatthieuDartiailh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

P403n1x87 commented May 8, 2026

Uh oh!

MatthieuDartiailh May 8, 2026

Choose a reason for hiding this comment

Uh oh!

P403n1x87 May 8, 2026

Choose a reason for hiding this comment

Uh oh!

MatthieuDartiailh May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented May 8, 2026 •

edited

Loading