Skip to content

Zach/mi300a fixes#1819

Merged
zatkins-dev merged 4 commits into
mainfrom
zach/mi300a-fixes
May 6, 2025
Merged

Zach/mi300a fixes#1819
zatkins-dev merged 4 commits into
mainfrom
zach/mi300a-fixes

Conversation

@zatkins-dev
Copy link
Copy Markdown
Collaborator

Reworks the stream implementation for /gpu/hip/gen to avoid creating and destroying streams on every operator apply.

Updates hipblas calls to only sync stream, this matters on MI300A since hipblas seems to use an async stream. Avoids a full device sync.

Also makes working vectors come from the Vector object delegate to avoid bad ref behavior.

Comment thread backends/hip-gen/ceed-hip-gen-operator.c
Comment thread interface/ceed.c Outdated
@zatkins-dev zatkins-dev force-pushed the zach/mi300a-fixes branch from cd2c84c to b5cfe2e Compare May 6, 2025 21:39
Comment thread interface/ceed.c
@zatkins-dev zatkins-dev force-pushed the zach/mi300a-fixes branch from b5cfe2e to b46df0d Compare May 6, 2025 21:41
Copy link
Copy Markdown
Member

@jeremylt jeremylt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, as long as this does as expected on the MI300A machines, LGTM

@zatkins-dev zatkins-dev merged commit 297af36 into main May 6, 2025
29 checks passed
@zatkins-dev zatkins-dev deleted the zach/mi300a-fixes branch May 6, 2025 22:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants