Skip to content

[HAL][AMDGPU] Support raw HSACO custom kernargs#24512

Open
AWoloszyn wants to merge 1 commit into
mainfrom
users/awoloszyn/amdgpu-hsaco-custom-kernargs
Open

[HAL][AMDGPU] Support raw HSACO custom kernargs#24512
AWoloszyn wants to merge 1 commit into
mainfrom
users/awoloszyn/amdgpu-hsaco-custom-kernargs

Conversation

@AWoloszyn
Copy link
Copy Markdown
Contributor

Allow AMDGPU raw HSACO modules to expose HIP-visible kernel symbols that have companion kernel descriptors, even when the code object metadata only describes a subset of exports.

Preserve native kernarg layouts for those raw kernels by widening reflected parameter sizes and offsets, accepting pre-packed HIP argument blobs that omit trailing ABI padding, zero-filling missing bytes, and synthesizing HIP implicit args at the metadata-derived or HSA-reported suffix offset.

Use ELF symbol metadata and kernel descriptors to identify additional raw code object exports and keep their dispatch ABI consistent with the loaded HSA executable.

Allow AMDGPU raw HSACO modules to expose HIP-visible kernel symbols that have companion kernel descriptors, even when the code object metadata only describes a subset of exports.

Preserve native kernarg layouts for those raw kernels by widening reflected parameter sizes and offsets, accepting pre-packed HIP argument blobs that omit trailing ABI padding, zero-filling missing bytes, and synthesizing HIP implicit args at the metadata-derived or HSA-reported suffix offset.

Use ELF symbol metadata and kernel descriptors to identify additional raw code object exports and keep their dispatch ABI consistent with the loaded HSA executable.

Signed-off-by: Andrew Woloszyn <andrew.woloszyn@gmail.com>
@AWoloszyn AWoloszyn requested a review from benvanik as a code owner May 20, 2026 16:37
@AWoloszyn AWoloszyn force-pushed the users/awoloszyn/amdgpu-hsaco-custom-kernargs branch from cedf5e0 to 57ac1d0 Compare May 20, 2026 16:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant