Skip to content

configs: add SM89_RTX4060_LAPTOP (Ada sm_89) tested config#135

Open
dzwduan wants to merge 1 commit into
accel-sim:devfrom
dzwduan:pr/sm89-rtx4060-laptop
Open

configs: add SM89_RTX4060_LAPTOP (Ada sm_89) tested config#135
dzwduan wants to merge 1 commit into
accel-sim:devfrom
dzwduan:pr/sm89-rtx4060-laptop

Conversation

@dzwduan

@dzwduan dzwduan commented Jun 3, 2026

Copy link
Copy Markdown

Based on SM86_RTX3070. Parameters derived from a real RTX 4060 Laptop GPU (AD107, CC 8.9): 24 SMs, 32MB L2, 128-bit GDDR6 (8 channels), 1890MHz core / 16Gbps DRAM. Per-SM resources match Ampere GA10x unchanged.

Two indexing choices differ from SM86 because of the 8-channel / large-L2 geometry (both verified by running real sm_89 traces; IPOLY asserts otherwise):

  • memory_partition_indexing 0 (IPOLY needs 16/32/64 channels; we have 8) -- as SM80_A100
  • dl2 set-index X instead of P/IPOLY (IPOLY needs nset in {16,32,64}; we have 1024) -- as SM80_A100

Based on SM86_RTX3070. Parameters derived from a real RTX 4060 Laptop GPU
(AD107, CC 8.9): 24 SMs, 32MB L2, 128-bit GDDR6 (8 channels), 1890MHz core /
16Gbps DRAM. Per-SM resources match Ampere GA10x unchanged.

Two indexing choices differ from SM86 because of the 8-channel / large-L2 geometry
(both verified by running real sm_89 traces; IPOLY asserts otherwise):
- memory_partition_indexing 0 (IPOLY needs 16/32/64 channels; we have 8) -- as SM80_A100
- dl2 set-index X instead of P/IPOLY (IPOLY needs nset in {16,32,64}; we have 1024) -- as SM80_A100

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant